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Genetic Variants 
Increase the Risk of Age-Related 
Macular Degeneration 



[01] '{ his mveTitii.'R wfis r-iadc ut.mg fui Jo 'x^ ir L S ^< \ vnvi<?nt grant no T U0£"^ 0 12 M i^. 
and r\ 0 IMJ 6 itmk th^ Xaiiom.l I -i' n iflo i tj ' MH ;/Nat)onaJ E^ e tostimie and 
b> gianT ACt! froiiti the NlH'Kduonal In&tttiite on Aging and b> RK 00095 fiom 
the National iintitujes of Health GCRC. Therefore the U.S. govcmmeiu retains 
certain rights in tbe iaveatioii. 

TECHNICAL HELU OF THE imTENTION 

£021 This inveetioB is related to the area of genetic teslmg, drag discovery, and Age- 
Related Macular Degeamttion, Isx particular, it relates to genetic vsaittats wbich 
ijicrease Ifae risk of Age-Rdated Macular Degeueralion, particularly in combination 

with certain behavior. 

BACKGROUND OF THE IISVENTIOIS 

103} Age-reidted ma>.ul<ii tU|ei!ci a (.'\MD"i ^a^-^ [u^t ssi ^ impatnnent of ctntidl 
JSiUii aud j:> the Icadiiis {Use of w. tw.h^ \ s<u\ -tndtki \mt.ni iiis (?) Ihe 
rao&i bi^^eie lonu o! \\[D in\ol t ^ n >, \ v*. ^1 s \ul *i\t. (ut i) and Oi atiopkic fdn) 
chaiiges to die maajk ^ 'S p s t»n vSs !< ^ of t ^! iargtl)- nnkjiovti, 

imf iicdted ns.k bctoii. UKUit l ^ti ctH)(v.ir ^-^ -^k i)£ h^pet tenssun obe=:n\ and diet 
(2) Famtlisl ^gtegaticu tv\- n ■-tudie-j (4) and setrr^gdUon, tmahtis ( <>ug^i.t>l 
IhaMlicre IS al^JD a '^jgiiittvc nl jjpnttK t^onhihution ti the disease Iht o<3.tKlidiitt geiu 
appjoax'h whith Kiv.t)s{-5S on tcstmg biokigicaily tcievatit LJiididatcs has imphtatcd 
vanants m tlie ABCA4, FBIA% and APOE genes as nslc tactors tor AMD. 
Rephcatxon of Jihe ABCA4 and FBLN6 ffudisgs has be^ difiiaiit, and in toto these 
yariaiits expkm only a small pmporttoji of AMD {6-8), An alternative genomic 
approacli uses a combinatioBi of genetic liiikage and association to idesntify novel 
genes involved in AMD. We participated in a recent oollafooiBtive genome-wide 
linkage screen (0) in which ob-omasoaie iq32 was identified as a likely region for as 
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AMD risk gene, a location ais.> siippviie;] hy ■-.xhtx ^tiKiies Tins region 

oontciins be^A'eeji over .100 genes, {sec: Oa-iine Mendeliiiii bilK-jitance i« Maft at the 
NCBI website) and no pariicular gene was Kbmificd by this work. 

[94] A.ge-r-3iated macular degeueration (^VMD) ir> a oommojv complex disorder ihat aiTects 
tbe central region of tiie rcitina (mjicula) and is tlie leading cause of legal bliiidaeys m 
older Aateiican adults, 'Ihe prevalence of AMD and its significant morbidity will lise 
sharply as (he popalatioii ages. AMD is <i cliniciilly heterogeneous disorder with a 
poorly isndersfood etiology. Poptilation-based longitudinal studies (Klaves^ et al. 2001; 
van Leeuwen et al. 2003; Klein et al, 2003) have established that tibs presence of 
extracelMar proteio/Upid deposits {dnisen) between the basal lamim of the retinal 
pigment epithelium (RPE) and the tuner layer of Bmclie* membraae is associated with 
m increased risk of progressiiig to m advaaced fon» of AMD, either geograpMc 
atrophy or exudative disease. The presence of large and indistinct (soft) drusea 
coupled with KPE abnormalities is considered an early form of the dtsoidtar and is 
oftaa refm-ed to as age-rdated maculopathy (AKM), 

f05J Epidemiologicaily, AMD is a complex disorder v-ith contributions of environmentai 
fectors as well as genetic susceptibility (Kiein et al. 2004). Mjtny eoviroitimental aad 
liiestylo feotors have been postulated, te by fer the most consistently implicated nonr 
genetic risk factor for AMD is cigarette smokiag (Smith et al. 2001). Much progress 
has recently been made in ideatrfymg and characterizing the genetic basis of AMD. In 
a remarkabk example of the convergence of methods for disease geaae discovety, 
multiple independent research efforts identified fee Y402H variant iu the complement 
factor H (CFH [{MM 1343701) gene on chromosome lq32 as &e first major AMD 
susceptibility' ailele (Haines et al. 2005; Hj^man et aJ. 200S; Klein st al. 2005; 
Edwards et at 2005; Zareparsi et al. 2005; Conley et al 2005). WhS& oas of the 
studies was able to pinpoint CFH on the basis of a whok-geaome as-sociatton study 
(Klein et ai. 2005), most studies focused on the lq32 region because it had 
oongistesQtly been iiBplieated by several \^iiole-genome linkage scans, A second 
genomic region with sirailariy consistent linkage evidence is cliromosojne 10q26, 
which was identified as the single most promising region by a recent meta-anaiysis of 
published linkage screens (Fisher et al 2005), 
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{06| Iw) recent sfikbes h'^\Z ^^^,^o■■r^ . " ID ■,C}'u5>l^u^ ^,''n&: l'>cat«t ua 

.4u< mj' »me 10q2'^ Ova ined a ^ombmdiioii of fTiHslv-b tG> J Mid crue t.(jntioi 
anal^seb lo uui'hcate the PLEMIAl ^cne ("pkckston hoiuology domxiiti (.ootaimng, 
family A (phosphoinositide biudiag specific) member 1 [MM 607772]) and the 
predicted JLOC387715 gsne (.Takobsdottir et al 2005). However, tlie assooiatioja 
sigaals for single-nucleotide polytuorphisms CSJ^s) in these two genes W£i« 
statistically iadistinguiabable. A second study using two ind^eadent case-control 
datasets concluded liiat the T allele of Sls^ rsl0490924 in LOC387715, a coding 
ohange (Ak69Ser) m exoa 1 of this poorly characterized geae, was the most likely 
AMD susceptibility allele (Rivssm et al. 2005), BoEh studies reported tiirnt tSie 
chromosome 10q26 variant eoiifera an AMD risk similar m magnitude to fhat of the 
Y402H variant in CFBL Here, we describe highly significant association of SNPs in 
LOC387715 with AMD. In our data, only SNPs in. Siis gene, including rsl0490924, 
explain the stiiong linkage and association signal in this region. Given a previous 
leport of an effect of cigarette smoking on the linkage evidence in the i0q26 region 
(Weeks et al. 2004; 9), we tested whether smoking modified this association. 

[07] There is a continning need in the art to identify individual geaaes that are involved in 
tte pathogenesis of AMD and'or to identify particular alleles that are iavoived in the 
pathogenesis of AMD, as well as to identify the interaction of the genes with 
modifiable behaviors, 

SUMMARY OF THE INVENTION 

108] According to one embodiment of the invention a meOiod is provided for assessing 
ittci^ssed risk of Age Related Macular Degeneration. The identity is detenained of at 
least one nuoieotide nssidue of Complement Factor H codmg sequiaiee of a persoa 
The nucleotids residue is ideatitied as normal or variai:rt by comparing it to a normal 
sequence of Complement .Factor H coding sequence as shown in SEQ n> NO: 1. A 
person with a variant sequmce has a higher risk of Age Related Macular Degeneration 
than a peison Vidth a normal sequence. 
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[m] Accof!:}i.ng to auothsr embodiment c; i-xsr.hcti piir. id^d j'oi ;;vi.vs-:ng iu.;rea.sed risk of 
Age Related Macular Degeneration. The ideiitity is dctertniuecl of -v, lenst. one amine 
acid residue of Compkmeiit Factor H pi-otein of a person. TIk- cc^;)d\ie is klemified as 
iioimal or variant by comparing it io a nommi sequeiice of Couipleiueni factor H as 
shown ill SEQ ID NO: 2, A pefson with a variant sequence has a higiver risk of A%q 
Related Macular Degeaeratlon than a person with a noimal sequeace, 

1101 Another eaibodimeat of the inveatiott pmvides a metliod far screening for a potentM 
Amg for Seating Age Related Macaiar Degeneration. A Complement Factor H 
protem is coatacted with a test agoat in the presence of a polj^oa. Binding of the 
polyaaioa to Complement Factor H is. measured A test agesrt is ideiitifed as a 
potenUai drug for treating Age Related Macular Degenatsttioa if it mcreases binding of 
Complement Factor H to the polymuon. 

[lij Aaother embodiment of the invention is a method for soreeaing for a potential drag 
for ti^sating Age Related IVfacular Degeneration. A Complement Factor H ptoteia is 
contacted with a test agent in the presence of C-Reactive Protein. C-Reaotive Protein 
bindiug to Complement Factor H is measured. A test agent is identified as a potential 
dnig for treating Age Related Macnlar Deg^eration if it increases binding of 
CompleramtFactor H to C-Rsaictive Protein- 

[12] A &ither embodiment of the invention is a method to ass^s risk of AMD in apatient. 
The presence of a T allele at rsl0490!?24 is determined in a patient Whether the 
patieat is a cigarette smoker is detenniaed. The patitsat is identified as bemg at high 
risk of A^^ID if the patient has the T allele and is a cigarette smoker. The patient is 
identified as being at lower risk of AMD if the patient has the T allele but is not a 
cigiirette smoker or is a cigarette smotaar but does not have the T allele. The patieat is 
identified as being at lowest risk if the patient does not have the T allele and is not a 
cigarette smoker. 

[id] Yet aiio&er embodiment of the invention is a me<hod to assess risk and treat AMD in 
a patient. The presence of a T allele at isl0490924 is determined in a patieat 
Whether the patient is a cigarette smoker is detennined. If the patient has fee T allele 
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at 1-310490924 ami is a cigarette smoker, beha%fioial tlierapi' is provided to the patient 
to encourage smoking cessation, 

{14} Still aiiotlier embodiment of ths mveotion is a method to assess lisk and treat AMD m 
a patisnt. The preseoee of a T allele at rsl0490924 is determined in a patient, 
Whetlier &e patietrt ia a cigarette smoker is determined. If the patient has the T ailsis 
at rsi 0490924 and is a cigatette smoker, the patient is provided wiih smokeless 
nicotiae to encouxags smojcing cessation. 

BRIEF BESCRTFTION OF THE BRAWllNGS 

I1.5J Fig, 1. Haply view plot defining haplotxpe block staictiire of AMD associated region. 
The :{iij]ative phyiiicfil position of each SNF is given in the itpper diagram, and the 
pmrwise litikage diseqmiibrium (D') beiweea all SNPs is givejx below each SNP 
combination. Dark red dmded squares indicated D' values >0.80. D~1.0 whea bo 
nnmber is given. 



11.6J Fig, 2. Plot of family-based and case^x^n^rQl P values for all SNPs within the AMD- 
assooiated h^lotype. The geaomic region smarming each gene is indicatecl in greaa. - 
iogso of tiio nominal P values are plotted for each SNP. Results for both the faniily- 
based and oasc-coiitrol data sets converge withia the CFII gene. 

[17} Fig. .3. Resijits of iiakage (lefi: axis: two-point and mtiltipomt iod scores) and 
association analysis (right axis; iogjo-transfonned p-vaJues from logistic ic^ssioa of 
case-control dataset, using additive coding described in text and adjtisted for age and 
aex). For exact p-vakes in 122-127 Mb region &at are smaller thm 10'^ see Table 5. 

{18} Fig. 4. ID pattern in region from PLEKHAI IMM 607772J to CUZDl [HONC 
179373. The relative physical position of eadi SNP is given in the upper diagram, and 
the pairwise X>* between all SNPs is given below each SNP combination. Red-shaded 
squares indicate D' values >0,80. D*=LO when no number is given, which is either 
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sigm&aat (datk-md shading) or noti-sigmficant {bkie sMding) based an the 
Haploview defeult de&iitioa (Gabriel et al, 2002) 

{19} Fig. 5 A. genotype fisqusiiciss at rs 10490924 m nm-ekted AMD patients, by pack- 
years of cigarette smoking. Fig. 5B, genotype fieqneiicies a* rs 10490924 in tmreJated 
oontrois without AMD, by pack-years of cigarette smoking 

|20j Fig. 6. Ordeissd subset aiiai>'Sis of 90 multiplex AMD fanuHea with information on 
pack-years of cigarette smoking. Dtished line: Multipoiat LOD* in 90 families. Solid 
iioe: Multipoint LOD* in 40 families with ^44 pack-years, avesrag^d across tmily 
members affected \vith. AMD. 

pi] Fig. 7: Table 4. Demographic and clicical characteristics of study population 

|22| Fig. 8: Table 5. SNPs in 122-127 Mb regioo witb p<0.(X)5 in casc-contro! association 
analysis. MAF: minor alleie txeqaency. Odds mtios (OR) adjusted for age and sex, 
estimated separately Ibr listerozygous (bet) and hosnozygous (het) canters of minor 
allele, ^-vskae j&om additive coding of SNP covaiiate described in text OlST: 
Genotype-IBD sbariag test (Li et al. 2004). 

[23| Fig. 9: Table 6. T^vo-locus genotype frequencies (%) and odds ratios for rsl 0490924 
inLOC387715 and Y402H in CFH, AH odds mtios adjusted for age arid sex. 

{24] Fig, 1 0 : Table 7. Results of fitting two-fector models by iogistio regression, adjtjsted 
for age and sex. Factor 1 is rsl0490924, model definitions in text. Afcaike's 
ijrfbrmation criterioa (AlC) difference is difference of the AXC -torn the best-fitting 

model. 

|2Si Fig.l 1 Table 8. Joint frequencies (%) and odds ratios for rei0490924 in LOC38771 5 
ffiid smoking Mstory (ever vs. never). All odds ratios adjusted for age and sex. 
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{26} Fig. 12: Table 9. Minor aiieie freqeencj' (Mi^F) and genot^'pe frequencies (iiumbar of 
individuak) at rs 10490924 by AMD grade. Data for stnokeis md aoii-smoters 
Qstimatsd fioni dataaet xi&ed for logistic regression modeling (Table 8). Data tor all 
geiioti'ped iiidividuais estimafect hy combining famiij'-based asid case-coiitroi dataset, 
including related individuals. 

(271 Fig. 13; Supplemeatai Table 1, SNPs idestttified ia LOCSSTyiS seqiisnoiug of 
individuals homozygous for rs 10490924 variant 



1.28] Fig, 14; Supplemema! Table 2. SNPs identified in CUZD'l sequencing of individuals 
homozj'gous for rsi89i 110 variant 



[29] Fig. 15: Supptenietxtel Table 3. Casts-coattul associalioii results for all SNPs i» 112- 
132 Mb region. 

l>ET.irLED BESCMPTION OF THE INVENTION . 

f30] The inveaitors Iiave developed methods for assessiag risk of developing Age-ReMe4 
Macular Degetieratioa (AMD) in affected ftaiilies and in individuals not kno^TO to be 
in affected femilies. Aitbongh deveiopiag the disease is a multi-factorial process, 
presence of a polyoiorpMsni in flie CFH gene (or con^Iement factor H protein.) 
indicates a greatly ■ increased risk (approximately double). Interestiagiy, one 
polytnorphisra is so pj^valent in the Caucasian popijlation that 1/3 of individaais catty 
at one copy of Uiat form. Moreover, identification of the CFH gene as involved 
in AMD pathogenesis permits the use of ike CFH protein in drug screening assays. In 
addition; we have identified a coding change (AlaSSSsr) in the LOC3S7715 gene as a 
second major susceptibility allele for AMD. The ovemll effect of fbe gene on risk is 
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drives by a Mglily sjgaificatii statistical interaction between ilie .L0C3 87715 variant 
and c^aiette sjnoking. 

The Y402H polymorphism (ejiooded by the T1277C polsmoipMsm) is located in tiie 
domaia kaowa as SCR7. See Table 3, SCR7 is kuowii to contain binding sites for 
both C-Reactive Protein (CRP) and polyanions, stieh as iieparin and sialic acid. The 
location of this highly informative polymorphism suggests tot not only is the CFH 
protein involved in the pathogenesis of AMD, but that the ability to bind one or botii 
of C~Reactive protein and polyanions is also involved. Variations in other domams of 
CFH may also relate to pathogenesis of AMD, including variations in domains that 
are involved in binding of complerasait factor C3b. Such variations may have an 
effect alone or in oonjonetion with the Y402H variant, 

{321 Any change in tiie CFH gene or encoded piotehi can be detemiined by comparing to 
the sequences of Qie major allele ixi the Caucasian population as shown in SEQ ID 
NO: 1 and 3, fornaoleoade and protein, respectively. Methods of ds^ecting sequence 
differences between a test sttbject's CFH and tlie major allele or major protein can be 
any method fcaown in the art. These inciode side-bynside comparisons of physico- 
chemical properties of proteins, immmological assaj^, primsr extension methods, 
hybridiisation methods, tiucleotide sequencing, amino acid sequencing, hybridization, 
amplification., PGR, oligomideotide mianatcb ligation asisays, primer extension 
assays, h&teroduplex analysis, aliele-specific amplification, ailele-specifio primer 
extension, SCCP, 1X5GE, TGCE, mass specti-oscopy, high pressure liquid 
chromatography, and comb inations of these techniques. 

133] Binding assays between Corapiement Factor H and either polyanions or C-Reactive 
Protein (CRI^) can be performed using my format known in the art. Binding can be 
measm^ in solution or on a solid support. One of tiie partners may, for example, be 
labeled with a radiolabd or iluorescent label. Partners can be identified losing first 
antibodies vvbich are either themselves labeled or measmed using second antibodies 
which are labeled and reactive with the fast antibodies. Assay formate can be 
competitive or non-competitive. 
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{34| Test agents can be natorai products or sp^iltetio, purifisd or mixtojres. They can be the 
products of combiaatorial chemistry or individual products or femilies of products 
which are selected, on the basis of sfcractmal informatioii. Test agents are identified as 
candidates for treating AMD if tb&y increase the biiidiag of oomplenient factor H to 
any of its physiological binding partners, including but not limited to C3b, sialic acid, 
heparin, and CRP. 

|35J The T allele is the variant of rsl 0490924 ihat has a T fit nucleotide 26 as shown in 
SEQ ID NO: 9, Other variant alielos as skovm in SEQ ID MO: 7-56 can be detected 
and used to assess risk of AMD. The other variants may be used mdependently or 
may be tised in coiyimction witti an assessment of smoker states. Current smokers are 
individuals who smoke at least once per week. However, historical smoking in an 
individual's past can also modify their risk of AMD, . 

[36] BehsN'iorai therapies \vluch can be recommended for smoking cessation include but 
are not limited to counseling, classes, printed information, electronic information, 
video or audio tapes. Providing a bdmviorai thempy may involve merely 
recommending it to a patient, prescribing it, or actually delivering the therapy. 
Smokeless nicotine is also a possible means for weaning persons from a smoking 
habit Smokeless nicotine, like behavioral therapies, may or may not require a 
physician's prescription- Smokeless Ibims of nicotine that can be used for smoking 
cessation or abatemmt include but are not limited to nicotine gums, transdemial 
patches, nasal sprays, and inhalers. 

137] Because the data indicate that the vanaiit of CFH and the variant of LOC387715 are 
independent predictive factors, they can both be assessed in the same person. 
Together, these two types of variants aie believed to account for Ihe maiorily of cases 
of AMD. Additional factors as discovered can also be tested, as they become 
available to the art, 

[38| Using iiemtive high-<!ensity SKP iissooiation mapping, we have identified a coduig 
change in the LOC387715 gene^ at SNP rsl0490924s as the most likely second major 
AMD sasceptibilify allele. We also generated statistical evidence of gene- 
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enyiroisment interaction for this variant, suggesting &at a genetic siisceptiMity 
coupled widi a modifiable lifestyle factor such as cigarette smoking confers a 
significantly hi^er risk of AMD thaa either factor alone. Genotype jfrequetioiss at 
rs 1 0490924 were strongiy con-elated with pack-years of smoking in AMD patietits, 
consistent with heterogeneity analysis of the gsaietic linkage data. It is striking that we 
liave obsen^ed evidence for gene-esvironmeat interaction in two different datasets 
Bsing two statistically independent approaches . However, the presence of statistical 
interaction does not prove biological interaction, and much work remains to be done 
to idafitify the molecular mechanism onderlyiii^ the inta-eased AMD risk. 

[39] Our data did not support the previously r^orted association of AMD with ilte 
GRK5/ROS10 region at -121 Mb (Jakobsdotlir et al. 2005) since the fow SNPs 
(hcvl809962, i-s871l96, rsl53757<>, rsl467813) that we genotyped in this region did 
not demonstrate significant association {p>0.05). The GIST and conditional hapkitype 
analyses suggested that only isl 0490924, and sitnwniding SNPs m LOC387715 in 
high UD with it, explained tSie linkage and association signals in this region. See other 
SNPs in LOC387715 at SBQ ID NO: 7-5<>. Neither analysis supported SNFs in the 
nearby PLEKHAl and PRSSil genes as being responsible for either the linkage or 
association evidence. Consistent with these results, the most significant single-SNP 
associations, the highest odds ratios, and the Mghest nonparametdc two-point lod 
score of 3.2 were contributed by SNPs in {he LOC3$77i5 gexie. Whiie we did not re^ 
seqtience the nearby PLEKHAl and PRSSl I genes, we genotj'ped the vast majority/ of 
SNPs exitmined by the earlier studies in our dataset. Several SNPs in the CUZD! 
gene, which is not in LD with tlie PLEKHA1/LOC387715 LD block, gave substantial 
association signals with logistic regression (smallest p-value: 0,0002), bnt allele 
frequency differences in cases and conJxols were much less piononnced for these 
SNPs (MAF,.^,, -55%, MM^^r^i, -4^%), compared to SNPs in LOC3S771.5 
(MAFc^~41%, MAFcoaifote ~26%). In addition, the GIST method and the conditional 
hapiotype analysis suggested that tiliese SKPs did not explain the linkage and 
association signals in this region. 



[46| The limitations of any retrospeotive epidemiologic study apply to our fiadiags, 
includmg the potential fox recall bi^ of past esxposares. The valich^ of the sanunary 
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PAR% estimates depends on the extent to vs-iiich oar case-control dataset is 
representative of a population-based sample of AMD patients md contiols. Since ow 
dataset was used to identify tbe LOC387715 susceptibility vadaitt, it is possible Jliat 
Its eftfect size, and hence its PAR%, was overestimated {LolHxaieller et al. 2003; 
loaiimdis et al, 2001). ladependeat popalation-based studies of large sample sixe. 
ideally coOected m a prospective feshion, fire needed to coiiiirra the statistical 
interaction between smoking aad rsl0490924 in contributing to AMD and its cslinical 
subtypes, am! to rejSae estimates of their individual and joint PAR%. 

[41] There is cuirentiy no biological explatiatton for the mechanism by which LOC387715 
may jacrease the risk of AMD. It is not clear whether this statistioal association 
provides furttier mpport to the lole of the innitte imimuiity system tliat was 

higllUghted by the recent disGoverj' of the CFH gene. LOC3S771 5 is a two-exoii gene 
lliat encodes a protein of 107 amino acids, whose only homologue is a chimpanzee 
geae of 97% protein identir>'. No significant matches were fbimd with my iaiowa 
protein motifs. ESTs have been recovered iiom the placenta and the testis, and this 
gene to recently been reported to be weatdy expressed in the retina (Rivera et al. 
2005). 

[42] In sutamar>', we have replicated and refcied previous reports ijnpHcating a coding 
change in LOC38771S as the second major AMD susceptibility allele. The effect of 
rsl 0490924 appears to be completely independent of the Y402H variant in the CFH 
gene. The joint effect of these two susceptibility genes is consisteot with a 
muitipUcative model, and togetlier, they may explain as mo.ch as 65% of die PAR of 
. AMD.' Previous data by our group suggested that the joint etfects of CFH and 
smokmg are also consistent wi& a multiplicative model {Scott et ai. 2005), la 
confeast, the effect of ssi0490924 appears to be strongly modified by cigarette 
smoking. Smoking and LOC387715 togeflier may explam as much as 34% of AMD. 
WMe the marginal effect of rs 10490924 was strong enough to be detected without 
incorporating smoking history information, an effect modification of a genetic 
susceptibility by a Kfestjde factor like smoldag lias important implications for the 
cMcal interpretation of this jgndmg. Our data suggest liiat the T allele at rsl0490924 
may only moderately increase the AMD risk in non-smokers and likely exerts its 
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stroagest effect on heavy smokers. TMs has the potsaiHal b> reduce the impact of m. 
AMD susceptibility allele on the aging populatioii by public healiSi efforts, such as 
smoking prevention and smoking cesaatioo progreims. Our replicatioa of the 10q26 
linkage heterogeneity due to smoldaig, and the consistency of results from multiple 
statistically independent approaches for assessing gene-environmeiit interactioa 
reported here, are unusual in genetic studies of complex human diseases and provide 
substantial support to our findings. 

f43| We nse4 iterative as^ciation mapping to itientify a si^eptibility gene for age-related 
macular degeneration. (AMD) on claomosome 10q26, which is one of die most 
consistently implicated linkage regions for tihis disortler. We employed linkage 
analysis methods, followed by family-based and oase-oontrol associattoa analysis 
using two independent datasets. To identify statistically the mo.st likely AMD 
susceptibility silJciCj we used the Genotype-IBD Sharing Test (GIST) and conditional 
haplotj-pe anaiysis. To incorporate tlie hvo most impDrtmit known AMD risk factors, 
smoking md the Y402H vtiriajit of tiic compiersient fector H (C.FH) gene, we used 
logistic regression modeling to test for geiie-gesie and gene-cnv:iromne.nt inicraction in 
the case-control dataset, asd she ordered subset analv^jis (OSA) to accoimt for genetic 
linkage heterogeneity' in die iBiniiy-based dataset. Our results strongly implicate a 
coding change {Aia69Ser) in the LOC3877IS gene as &e second major AMD 
siisceptibiiity allele, confirming earlier suggestions. Its effect on AhiD is statistically 
independent of CFH and of similar magnitude to Y402H. The overall effect is driven 
prinrarily by a strong association in smokers, as we obsciTcd significant e^ddence for a 
statistical interaction of the LOC3S77I 5 variant with a historj' of cigarette smoking. 
This gene-environment inteniction is supported by statistjcally independent femily- 
based and case-control analysis mefcods. We estimate that LOC2S771 5 and smoking 
togeth^ explain 34% of the population-attributable risk (PAR) of AMD, fvst&m, we 
estimate Oiat LOC387715 and CFH together account for 65% of the PAR. of AMD. 
For die ilrst time, we demonstrate Uiat a genetic st^ceptibility cotipled mth a 
modifiable lifestyle factor such as cigarette smoking confers a significantly higher rids 
of AMD than either fector alone. 
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{441 TJis above disclosure geoerally describes the present inventioii. All rsfeftmces 
disclosed herein me expressly incotporatsd by refereHce. A aiore complete 
tmdeManding cm be obtamsd by reference to the fbllowittg speciSo examples which 
are provided herein for purposes of illastration only, and are not intended to limit the 
scq[)e of the inventioa. 

EXAMPLE 1 

{451 To idealify the responsible geae on cljromosome iq32, we initially genotyped 44 
SNPs (12) across the 24 megabases (Mb) incorporatijig this Hnltage regioa. Ws 
examined two independent data sets: the first contained 182 femilies (111 mxiltiplex 
and 71 discordant sibpairs) and the second contained 495 AMD cases aiid 183 
controls. Each S>BP was tested for association independeMy in both data sets. Two 
SNPs (182019724 and rs6428379) in moderate linkage diseqniHlaiiiRi wife each other 
(r'^O.eX) generated highly sigaificaiit associatiom with AMD in both the fejnily- 
based data set (is2019724, F==0.0001; rs6428379, iM).0OO7) and iti the case-coniiol 
data set (rs201 9724, PO.OOOl; rs6428379, P<0.000 1), These SNPs lie approximately 
263 kilobases (Kb) apait 

EXAMPLE 2 

1465 To dejEise Uie extent of linkage diseqijilibrium completely, an additional 17 SHPs 
were gcsnotyped ats-oss approsdmateiy 655 Kb flanked by rgl538687 aad i3l537319 
and ena>mpassing the 263 Kb region. Two linkage disequilibrinni blodss of 11 Kb 
and 74 Kb were identified and wetre separated by 176 Kb (Fig. 1). The 11 Kb block 
contained is20i9724 and the 74 Kb blodt contained rs6428379. Association analysis 
of liie 17 SNPs idcsntified multiple additional SNPs giving highly significant 
associations in one or both of the family-based and c^e-control data sets (Fig, 2). In 
the mc-coiitro1 data set, a frv^e SNP h^lotype (GAGGT, defined by SNPs rsl 831281, 
rs3753395, rsl 853S83, isl0494745, and rs6428279, respectively) coiwprised 46% of 
the case and 33% of the control chromosomes (P==O.0GO3). This same hapioiype was 
also significantly ovef-hansmitted to aSected individuals in tlie family-based data set 
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(7^0,00003). The coavergence of the most signifjcant associations to (ids same 
hapiotype in the two mdqjendetit data sets strongly saggests tliaJ: &is region contains a 
commonly inherited variaat in m AMD risk gene. 

[471 l^e associated GAGGT haplotjpe spans apiHXJximatdy 261 Kh. It contains Oie 
Complement Factor H gene {CFH, OMM #: 134370, Accession #:NM„000186) at«:! 
the five Factor H-ielated genes CFHLi-5, md lies wifbiii the Regulator of 
Complement Activation (RCA) gens cluster. The most consistent association results 
(Fig. 2) from both the &i»ily-based aad case-control data sete converge wiftin the 
CFH gene implicating CFH as the AMD susc^tibility gene. The biological role of 
Complement VaaiXiS H as a component of the iomte imtnime system that modulate* 
infkiHination through regulation of oomplensent (reviewed in {13)) enhances its 
atlxactiveness as a candidate AMD susceptibiiity gene, inflsnnmation lias been 
repeatedly hnpHcated in. AMD pathology. C-reactive protein levels are elevated in 
advanced disease {14), MXti-retioal autoantibodies have been detected in AMD patients 
(i5), macrophages are localised near neovaacnlar lesions (IS), and fee halhnark 
dmsen deposits contain many cjomplement-related proteins (1 7). 

EXAJViPLE 3 

|48| We screened for potential risk-associated sequence variants in the coding region of 
CFH by sequencing 24 cases with severe neovaseular disease and 24 controls witli no 
evidence of AMD. To maximize fee likelihood of ideating toe risk-associated 
allele, all secjuenced cases and controls were homozygoi^ for the GAGGT haplotype. 
Five novel and six known secjueace vmaats were detected (Table 1). Only one 
variant (isl061170, sequence: T1277C, protein: Y402H) was present significaatly 
more often in eases than contwis, occurring on 45/48 hapiot>pes in the cases and cm. 
22/4S bapiotypes in the coatols (PO.OOOl). The frequency of sequence variants 
within the CFH codiiig region on the associated haplotvpe was significantly reduced 
in cases compared to cotitrols (12% vs. 18%, P=0.002}. When the over-represeated 
T1277C valiant i-emoved firom (he analysis, this dit&rence became mors 
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pronomiced (3% vs. 16%, P<0.00001)- Thus T1277C is &e mmaxy DNA secpeace 
variant differeatialing beiweett fte case and coatrol hapiotj'pes. 



wo 2«W0<M56t 



Table 1. CFi? sequence variants idsntifiyd in neovascuiar AMD cases and uonuai coiitrois. 
All mdividuftls were homozygous for fhe AMD-associated GAGGT haplotyps. Ihc 24 
fiffecled iiiciividuals selected for seqiiencing had severe neovascuiar diseaso (gnKic 5) {12) 
wiiU diagiiosis before age 74 {moim age at diagaosis: 65.8 jts). The 24 control iadividuEiis 
scioctod ibr sccjuencmg had ho evidence of AMD (grade 1) with ijge at exani after age 64 
(mean a[?,e at. exrini; 69. S yrs). The six pi cn-ioxislj- idtMnifted SNPs are labeled using standard 
nomeaciatiue. The five riove5 variants; ace libekd giver? ibdi- base pair location on 
chromosome 1, Ensembl build 3S. Five SInPs .:xe£if:e ;x;n-f>-.nc;ri:,^ious amino acid changes 
witkiii CF.M' and .five SNPs create syiniiiyniDus clutng;-,-. lixon t is not translated. 
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[49] We scieeaed ibr poteutM risk-associated sequmce variaats m the coding rogton of 
CFH by sequendog 24 cases wi& severe neovascukr diseasse and 24 controls with so 
evidsaee of AMD. To maximize the HkeKhoad of identifyiog &e risk-associated 
allele, all sequeaced cases smd coatrols were homozygous for the GAGGT haplolype. 
Five novel and six ibiO'tvn seqaence variants were detected (Table 1). Only one 
variant (rs 106 11 70, seqaence: T1277C, pToteta; Y402II) was jweaeiit sigmficaatly 
more often iji cases flian controls, occiirnng on 43/48 liaplotj'pes in tiae cases and on 
22/48 Implotjpe-s in the co«t!ois (;P<O.OfX)l). The frequency of sequence variants 
within the CFH codsiig region on the associated haplotype was signiticastly reduced 
m. cases compared to controls (12% vs. 18%, P='0.002). When the ovcr-represontod 
T1277C variaBt m'SS retnoved from Oie analysis, this difference became more 
pmtiottuced (3% vs. 16%, /'<0.00001), 'Oms Tt277C is the primary DNA sequence 
vatiairt diiferentiating between tlie c^e and control haplotypes. 

EXAKIPLE 5 

{50} Complete genotyping of 112770 in tiie family-based aiitl case~co«.trol data sets 
revealed a signiJScaat over-traa^miss.ion m the faniihes (P-O Ol'*) {IT) and a highly 
-jtgi ifi;C<nn i.>^er lepte'^cntatKm 'u it * n'pvucd '« i-otiituh (P~0 00006) The 
odd icitiofor WfDv i'. " 4 ( "^''-^i-i 141 - '^^) for uuiias ot C dUele and ^i '^3 
CI 1 '"i'b 20) lor wmas, vi iwQ C alLle-^ W heo Ilk nktlj-sss %\ as le^tntted to 
ouh nLO\a uiUir \MD thest odds n ios inueiNcd to ^ 4^ (9^% CI 1 ''e-i-^^'^) .md 
17 (^^"o CI 2 ^^2-1 2 ''7) res?.t \ds flu 1= p ircnt dn e efttU tor 5 i\K is oaateti 
V ) th ihL C illt'k V. ^ highh igiutK a it {P <0 OOt^ 1 ) T hot ^ u ^s no ajipai ent lUchc ^T 
gen tApic, «lftct Ti2'"7C on a^e al AMD dhunosts {raeni a<fe d i^nosK TT 
76.Svis; TC 77.5yrs; CC 75.5 yrs). The population atlnUitable risk percent lor 
carrying at least one C allele was 43% (55% confidence intmal 23-68%). 
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[51] The y402H variant is prisdicted to have &iictionaJ conseqtiences eoitsistent witia 
AMD pathology. Residue 402 is located witiiio binding sites for heparin. (18) and C- 
reactive protein (CEP) {19). Binding to either of these parsers increases tlie aflStiity 
of CFH for the complesnesit prnteis C3b {20, 22\ augmenting its abilitj" to down- 
regulate eomplement's effect The obssaved co-iocaiization of CFH, CEP, and 
proteoglycans in the superficial laj'er of the arterial intima suggests that CFH may 
protect the host arterial wall fiom excess complemeait activation {21). We 
h>'pothe.size that allek-specifio changes 'm tiie activitieg of the faiadtag sites for 
heparin and CRP wovtB alter CFH's ability to sitppress compiement-relatsd damage to 
arterial walls, and might althnateiy lead to vessel injury aad sabseqnent 
iieovascuiar/exxKladve changes such as those seen in aeovascular AMD, 0«r data 
support fids hypothesis since the risk associated with the C tillele is more prortouinced 
when, the analyses are restricted to noovascujiar AMD. Given the known iunctioaal 
interactions of genes wifliin the E.CA geae cluster (23), variants witbm fiiese genes 
conld interact with or moiify the effect of the TI277C variant 

[52] Interestingly, plasma k-vch of CFli are ktiosva to decrease both with age and wM\ 
smoking {23), two known risk factors for AMD (2). This confluence of genetic and 
enviix)nmental risk tactors suggests an nitegrfitod el ioiogical model of AJVfD involving 
chronic inilammatjon. Identification of the increased risk of AMD associated with the 
T1277C variant should enhance om abiiir/ to develop pres>TOptomatic tests for AMD, 
possibly allowing earlier detection and better treatment of tliis debilitating disorder. 

BX.A.MPLE 6 (relates to examples 1-5) 
Participajits 

[53j We ascertained M/ID patients and feeir affected and tamffected family members 
through two climes in the Sonifaeast^a United States - Duke Univetsi^ Medical 
Center (DUMC) and Vanderhilt IMversilj^ Medical Center (VUMC). Um-elated 
controls of similar age and ethnic backgrotmd weitj enrolled vis (i) study 
advartisemeat in DUMC- and VUMC-amiiatetJ newsletteis; (ii) recraitmeat 
presentations by study coordinators at local retirement coimauaities, who were likely 
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to obtam beaiih sare at DDMC or VUMC, resisectively; {in) AMD-mM&d seaimars 
for the gm^ai pisblic sponsored by DUMC or VUMC opbthalmolog)' clinics, (iv) 
referrals from other clinics in tbe Duke aiid Vaaderbiit Eye Centers of i»dmdaals 
witboat evidence of ocular disease. Spotises of Al^lD patieats were also asked to 
participate as potential controls. Controls eligible for eoroilment were offiml a free 
compreheatisive eye exam iacluding fundus photogtapiiy to ensure tliat (he same 
melbodology was used to assign AMD pades aa for the AMD patients and tbeir 
lektives ascertained in oiimc. AH cases and controls included in this siady were 
Caucasian m& at least 55 yeara of age. Hie study protocol was approved by the 
respective liistitatioiial Review Boards (IRB) at DUMC and VXMC, and the iieseardi 
adiiered to the tenets of the Declaratioa of Helsinki, 

[54} Ttie fetniiy-based data set consisted of 111 multiplex families with at least two 
ii)di\dd«als with grade 3 or higher AMD in at least one eye. Seventy-three families 
had two affected individuals, 29 families had three afifected individuals, and nine 
families had ibiir or more greeted individuals. Unaffected spouses and siblings were 
ooUectcd whenever possible, 71 additional families consisted of one affected 
indi\'idusl and at least one unaftected sibling (discordant sibpairs). 

Climcal Assessmmt 

l$5] The assignment of AMD affection status was based on Uie clinical evaiuation of 
stereoscopic color fundus photographs of the macula (EAP, AA), according to a 5- 
^de system described previously (SI). Or<jde I has no AMD features, grade 2 has 
only small non-extensive dmsen, ^ade 3 has extensive intermediate and/or large 
(iiusen, grade 4 is geogr^hic atrophy, and grade 5 is neovascaiai" AJS'ID, This system 
h a sli^it moditlcation of the Age-Related Eye Disease Study (AREDS) grftdiag 
system iffld iises example slidss from the Wisconsin Grading Sj-stem (83) md the 
international Classiilcation System {S3) as guides. Aflijction statas was defined by 
the snost severe grade hi either eye. All questionnaire data and samples w«a^ collected 
after infbmied consent was obtained. 
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MoJecHiar Aiialyses 

<i ti ijiiu ^>^*- 1 Uoi nu i f I V ^ I tic Dik ( IK) 0 ^ oiuiubilt 
CJfOP r>"^ \ 1 »ikni (Cits I smp the Puicuonc > stem {<j(rijd ^jsutin 
MiBic-ipolis OT! ^Ti A.uto)fUte I S Oenoi^Tiiig \\ !5> pcitoi 'icii i3^in» iiiiman on 
(he ABl ?mm ^-^OOHi, rod Analmdwtli thv SDS ^o^t^\<w'e S\P \o-a>>(.)n 
Detnand or Assays-By-Desigii were obtained fix>m Applied Biosystems iicoiporated 
(Foster City, CA). The mitj&l set of 44 SNPs was chosen to approximate a 500 Kb 
spacing between oiiiikeris. 

Exons of CFH were PCR ampMed fiom genomic DMA, sequenced ashsg Big Dye 
t3,1 (ABi) on an. ABl 3730 automated sequsacer, and analyzed usmg Mutation 
Stin^eyor softvvttre (So%eaedcs, State College, FA), T1277C falls within a geaomlo 
duplioation and could not be gejiotyped using TaqMaii assays. Ail individuals were 
sequenced using primeis CtGTTTCTTCITGAAAATCACAGG (SEQ ID NO: 5) and 
CCATTGGTAAAACAAGGTOACA (SEQ iD NO: 6) to determine T1277C 
genotypes. 

Statistical Analyses 

Linkage diseqnUibtium and Hardy- Weinbtjrg equilibrhmi calculaiioas were done 
asiag Hapioview version 3,0 lisitig all case and control samples md one random 
individual from each of liie families (34). Haplotype Mocfes were defjnsrf using the D' 
parameter and fhe defeult defmitions within Hapioview. Allele frequenoy diffensnees 
wci'e tested using a test. 

Siugle-loctB and haplotype family-based association was tested using the Association 
in fee Presence of Linkage (AFL) method (SS) that performs a correct TDT-style test 
of association in the presence of linkage, usittg nuclear families witii at least one 
afiected individual and any manber of unafiected siblings or parents. Odds ratios 
were calculated using standard logistic tegressicai models (SAS version 9.1, SAB 
Institute, Cary, NC), The outcome variable was AMD affection status and getiotypos 
were coded according to a log»additive model Dose-response was tested using the 
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.test tor tread, Haplotype analysis in Qis case-control data set was tested usiag tlia 
"haplo.stats" ptograra that tise-s a Hkelilioocl-faased jnetliod to estimate haplotype 
frequencies {S6). 

im\ The 95% coflfidenoe interval for the population attributable risk percent CPAR%) for 
Ti277C was calculated oa tiie point estimate of the ?AR% (43%), which was 
caicniated ixom the combined fesqueaey of genot>pes CT and CX in controls and the 
unadjusted odds mtto (OR) of AMD for these genotypes relative to tiie XT reference 
group (S7). Calculation of the PAR% from case-control data assumes that the controls 
are representative of Ihe geneml population and ths disease is tare (< 5% population 
prevalence across ail exposure levels). PAR% calcufetsd fiom OR adjusted tor age 
and sex was similar. 

16ii We note that fee P-value of the T1277C association in tlie family-based data set is not 
as significant as the iP-vaiue for the two original SNPs. This resiUts from the 
asoei taintnent Has toward se\'ere disease in ttie femily coHection, wliich results in an 
oveisampliug of T1277C-CC homoaygotes. Family-based testa of association depend 
on both transmission and association. Oversampling for homozygosity reduces tlie 
power of any feniily-based transmission disequilibrium test Since the original SNPs 
have lo%v linkage disequiiihriam vahiea with T1277C {r^-0.00 mid 0.14 for rs2019724 
and rd6428379, respectively), diey were not ov«sr-sampled for homoaygosity to ihe 
extent of T1277C. to the case-control data set whsaie Qte sampling bias is not as 
proft>uttd, the P-values fi>r all three SNPs are simdatly highly significant. 

Haplotype Aaalj'sis 

\m The five SNP hapiotj-pe block, defined fay SNTs rsl 83US1 , i-s3753395, rsi853S83, 
r^l0494745, and rs6428279, identified five common haplotype.'} that capture over 95% 
of the haplotype variation (Table 2). The OAQOT haplotype is the most conanon in 
both the cases and controls, but is significantly more firequeat in the eases. 

Table 2. Ihe haplotypes and their fiequencies caictdateci irom the case-control data. The 
Implotype consists of SNPs rsl831281, rs3733395, rsI85S883, 1^10494745, 8ndis6428279, 
respectively. 
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'labie 3, Location of SCR domains in protein. 
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EXA^IPLE 7 

Linkage and Assoctatios Analysis 

[63] Resequeaicing of tihfi LOC3S771.5 md CIJZDJ genes identified 21 kaowa md 23 
novel SN.Ps (Sttpplemental Tables 1 and 2). Sequeacic^ primers md coaditioiis are 
available frojii flie authors (MAH) upon request. Of tiiese 44 SlSiPs, 19 were 
genotj-ped in our entire dateset. Geaotypes for all SNPs aaalyzad here were in Hardy- 
Weinberg equiMbriBiu in vtarelatsd ooritmls (p>0,01). We observed high LD CD'>0.9) 
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across a 60 kb region iacittding a ftequent coding SKP m exoB 12 of" FLEKIIAl 
(rsI0452l6), jteee coclmg SNPs m LOC387715 (rsl0490923, r327369U, 
rsl 0490924) md several addjiion»l non-coding PLEICHAl aiid L0C3 87715 SNPs, 
replicatiag earlier observations (Hivera et al. 2005). Notably, the adjacent dowKstreaia 
gsae PRSSi I (HtrA serine peptidase 1 (HTRAl), [MM 602194]) was not metaded in 
this 60 kb region (figure 2), 

1^41 la fee family-based linkage analysis, a peak multipoint lod score obtained at 
124,7 Mb (HLOD 3.0 under affeoteds-only donunant model, nonparametric LOD* 
2.6, figure 1), SNP rsl0664316 in LOC3S77i5 (124^ Mb) gave a maKimam 
nofiparametric two-point lod score of 3-2. In tUe case-control analysis, ibar highly 
correlated SNPs in the LOC3S7715 gene, including the fteqti^t coding change 
ra 10490924 in exon i previoysiy implicated (Rivera et al. 2005), woxs very strongly 
associated witti AMD, with iogistic fegreasiDa p-vahies on the order of 10'* (table 5). 
Ilie minor allele ftequency {,MA,F) of thcss liigbly correlatexi SISiPs was -41.7% in 
oases, very similar to that reported by Rivera ei al., and --25.8% in controls, somewhat 
liigherthan the 19,6% reported by Rivera et: al. Within die 60 kb LD b.locic and m the 
m&m 122-127 Mb region, <issocif.tion signals of tliis order of magaiasde were 
obseived only for this set of higWy coirelated SNPs. In particular, the coding SNP in 
exon 12 of .P.LEKHAI (iral045216) showed substantially weaker evidence for 
associatioE, bodi in terms of magnitside (odds rado, OR) and statistical significance 
(MAFfisssa: 28.2%, 1S'lf\F«o,-,trob: 36.8%., OR-0.6, p^0.02). Unlilce the previous reports, 
we detected a second region of association 400 kb distal to LOC387715 fliat included 
sevsiBi SlSiPs in fee CUZDl gene ajid an evsai more distal SNP in the .FAM24A gene 
(feimly wilii sequence similariiy 24, member A [HGNC: 23470]). ITiese SNPs, which 
were in L0 with each other but not in ID with the associated SNPs m IX)C3877i5 
(figure 2), showed independent evidence for association with AMD risk, although at 
much, lower staiisficai significance {MM^: -55%, MMc^: ~48%, p»0.0002- 
0.0058). 



EXAMPLE S 

GIST Analysis 
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[65\ All SNPs with p-values <0.005 in &e ca$s~oonta-)i analysis were analyzed with GIST 
to test if tliey explained the linkage signal in tfes region. Under the sukiitive weigirtijjg 
scheme suggested by the case-control analysis (Li et ai. 2004), ooly the four SNPs m 
the LOC387715 gene were significaat in &e GIST aaalysis (table 5). Tlii? suggests 
&at the LOC38771S gene alone is responsible for the 10q26 linkage evidence. 

EXAMPLE 9 

Conditiojaal Haplotype Analysis 

{661 With the combined case-control dataset, we used eonditiotial liaplotj'pe modeling to 
identify the dumaticaliy mcsi iikely AMD susceiJtibtlity vsdmt from among alt the 
SNPs with sti:ong evidence for associaiion. We lested each SKP in faible 5, 
conditioning on (he risk allele of xh-s most siioagly associated SNI* in CUZDl, 
FAM24A and I...OC?^77L^. Cotniitioning on the ride fUkle at rslSPlUO in CUZDi, 
rsl0490924 w;is sjin.Migiy a.sscK-iatod (f:-^7.6B-ii5) while none of the other SNPs were 
.signiilcant (p>0.05). CuuditunuiU! ou th.« tisk M i-^229W^S in FAM24A, 

isK)4')0924 wsfs sUc)ii;.dy ;isw.H:.i;!ttx] (j^ - 7-1F.--05.) vviiit;-. none of the ofher SNPs were 
sigoiljcuni i'jM),0.5). Only conditioning on the rhk allele at r,siO4<>0924 fully 
expkiuieii the association signal in the region, such that none of the otlier SNPs 
showed any evidertce for association (p>0.6). Thn.s, tlii.s finai>'>it.'i also strongly 
implicates tlie 1J3C387715 gene alone in A.MD, consistent -wiih (Jie Riv^a et al, 
study, 

Gene-Gene Interaction analysis 

[67] We estimated joint odds ratios for all genotjpe combinations of the Y402H variant in 
CFH and the i-sl0490924 variant in LOC387715 (table 6). The TT/GG combination 
was used as the referent group. IPot individnals with the TT genotype at Y402H, the 
GT genot5;pe at jsl0490924 conferred a 2.7-fold increase in AMD risk (p«0.02) and 
the XT genotype confesred a 13.1-foM incr^e (p=0.003). For indi^'iduals with the 
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CC genotj-pe at Y402H, which conferred a 4-foW iacreasa m A.M0 risk for TT 
genotypes at rsl0490924 (p-^.0007), the GT geaotype conferred a 12,6-Md iiicfease 
in AMD risk (p<0.0001) md tlie TT gciiQt>pe conferred a 23.S-fold more^e 
(p<0.0001). Consistent wi& ifsmK& of the AlC modelmg strategy (table 7), the joint 
action of the y402H and the rsiO490924 variants was therefoi-e best described hy 
independent mullipKcative effects, without statistically significant evid^ce for 
dominaftce effects or epistatic iatBiaction. The joint efteot of Y402H and rs 10490924 
accounted for 65,1% of the population attributable risk (jPAR) of AKID (BrtEzzi et al. 
1.9S5). 

EXAMPLE n • 

Case-Control Geae-Eavir«»i»e»t Interaction Auaiysis 

[6S] In contrast, we ibmid strong evidence for statisticai interaction of smoking and 
genot5--pes at rsi0490924. Ilie model with the ADD„SMOKE_IOT temi provided a 
sigmXicantly better fit to the data hy 5,2 AlC iitiits, compared to the model without this 
terra {table 7), A aigKiiicant product temi with positive regression coefficient for 
smoking and rsl0490924 in die logistic regression mode! indicated more than 
multiplicative joiiit eftects (p-0.007). la our dataset, the presence of the LOC387715 
susceptibility allele did not center a significantly incTeasetl risk of AKiD to non- 
smokers (p-0.59 for GT genot>pe, p=0.12 for TT genotjpe, table 8), while the GT 
genotype in smokers increased the nsk 2.7 fokl (p=<>.00l) and the TT genotj-'pe in 
smokers increased the risk 8.2-foid {p<O.0O0i). A case-only analysis of rsl0490924 
and pack-years of smoking (aa a contin«o«s variable) also supported the presence of 
gene-envirormtent interaction (p«0.05 adjr^ted for age and sex). The relative 
ftequeaacy of TT" genotypes in affected individuais mcreased ahnost lineariy widi 
increasing pack-years of smoking, wito a coiresponding dea-ease of QQ gmotype 
fiequencies (fignre 3, panel A)." Tliis pattern was striMn^y similar to results for 
simiilated data when the disease status was generated vviih a logistic regression modd 
including a geQe-esi%'ijronmeat interaction term (Scljiaidt et al. 2005). Genotype 
frequencies at rs 10490924 were not related t» paok-yeaars of sanoking in ow oonttol 
sample (Fig. 5B), cojiirnning that the resdt in cases was due to gene-environmeat 
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mteraction ratJier taan popttiatioB corrdatioii of the two factors. The joint effect of 
rsl0490924 and smotdug accoiiated for 34.3% of the PAR of /\MD, 

EXAMPLE 12 

Family-Based Gene-EnviroBiae«t biteracttoB Analysis 

[69] The hi^y significan!: association of AMD with rsl0490924 that was observed in the 
mitial cjase-control acMysis was not repHcateci in die tkajly-based analysis with APL. 
This could be due to tie smaller size of ovx faraily-based dat^et, or to between-fatnlly 
heterogeneity. To test the latter possibility^, we applied OSA to our miatiplex family 
dataset, using the average pack-yeare of smoking in affected individuals as the OSA 
eovaiiate (ordered from Mgh to low). OSA indicated that the majorit/ of linkage 
evidence in the 10q26 region was coiitiifouted by otily 40 famibes with an average of ^ 
44 pack-years of smokiag (figure 4). The difference m nonparametric lod scores 
between the 90 mtdtiplex families with sufficient iiitbmiation to calcvtlate average 
smoking pack-yeans and the 40 jEamiUes with heavy smokets was significant 
(p=0.048), based on 10,000 xrms of the OSA permaiation test (Hauser et al, 2004), 
When &e APL analysis vras repeated using only mnlliplex and smgleton families 
which met the "heavy smoking" criterion in affected individuals (family-average of > 
44 pack-years of stJioking, 46 jEamaies total), &e results confirmed the case-coiitrol 
association, analysis: The APL p~value for rsl0490924 and 153750848 in LOC3S7715 
was 0.02. Three SNPs in other geaes also had p-vaiues of 0.02; rs76033g in PilSSl 1 
adjacent to LOC3S7715, rsl052715 in DMBTl (deleted in malignant brain tnmors 1 
[MIM 601969]) and hcv2917031 in CtPR26 (G pmteiti-ooupied receptor 26 [MM 
604847]). .Neither SMP had a case-control association p~vatae<0.05 in the overall 
aoalysis. 

BXAJ.4PLE 13 

Climcal Subgronp Analysis 

(70j It is of great oliiiicai interest to detennine whether the modification of the IX)C38771 5 
iis.sociation by cigarette smoking is observed in both geogmphio atrophy (OA, grade 4) 
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md Kfiovasciikr AMD (CNV, grade 5). Table 9 shows tiiat the sttong association witli 
LOC3S771 5 in smok^ was priinarily diie to genotvpe fequertoy differences between 
grade 1 controls (8.3% wift geoofrs-pe Tf) mid CNV patients (29,3% with genotype 
TT). Whes aU geaotyped mdividuals regardless of smoking liistory information w«a:e 
evaluated, the fnsqueacy of &e T allele was highja- in patients wife CNV (47.6%) 
compared to GA (39.0%). Our dataset had iimited statistical power for the AMD 
ssibtyps comparison since it inoladed a much smaller nnmber of GA patients, 
cotBpared bo CNV patieats (table 4), and since smoking history' infoimatioo was not 
STOilable for all study participants, 

EXAMPLE 14 (relates to examples 7-13) 
Study population 

{11} As part of an ongoing large-scale study of genetic and cnviromnetital risk factors for 
AMD, we have ascertained AMD patients, their affected and unaffected family 
tiieitibers, and a group of unrelated comiols of similar age and eJlaiic backgroatid at 
two sites in the Southeastern United States: Duke University Eye Center (DUEC) and 
Vanderbilt University Medical Center (VXJMC), Using stereoscopic color fimdus 
photographs, all enroUed individuals svere assig)i8d (by EAP and AA) one of five 
different grades of macular findings, as described previoiisly (Schmidt et ai, 2000; 
Seddon et al 1997) and summarised in Table 4. Our AMD ciassification is a 
modification of the AREDS grading system, using Wisconsin gradtag system example 
slid^ (Klein et al. 1991) and ihe Jtxteraational Classitication System (Bird et al. 1995) 
as guides. The more severely affected eye was used to classify indi\dduais. Unrelated 
. controls were enrolled via (i) study advertisement in DUEC- and VUMC-aMliaJed 
mwslettei^ (ii) racroitment presentsrtions by study coordinators at local retir^ent 
commumties, which were My to obtain heal& care at 0UBC or YUMC, 
respectively, and (iii) AMD-related seminars for the genearal public sponsored by 
DUBC or VUMC ophtfaalmoiogy cHmc«. Spoases of AMD patients were also asked to 
participate as controls. AH cases and controls inchidsd ia this stady were white md at 
least 55 years of age. The study protocol was approved by the tostittttional Review 
Boards (IRB) of the Dake Univeraity Medical Center and VUMC, the research 

27 



wo 2«W0<M56t 



adhered to ths tenets of tiie Declaration of Helsiald, aad inforaied consent was 
obtained from all study parttcipante, Biood samples wssre coiiected &xid genomic DNA 
was extracted firom whole biood ysing the PujeGeue system (Omtra Systems^ 
Minneapolis, MN) on an Autopuw LS, 

[72] Ittfoimatioii about ilie snioldtig history of study participaiits was obtained ftom a self- 
adniinistered qxiestioimaire that was foimatted to maximize readability for individuals 
with iow vision. However, if participants indicated that they could not complete the 
form, a project coordinator offered to assist tlie participants in fiJling out the 
qnestionnaite. Regular cigarette smoking wsis assessed fay two questions: 1) "Have 
you smoked at least 100 cigarettes in your lifetime?*' and 2) "Did you ever smoke 
cigarettes at least once per week?" Individmls answmctg '*yes" to bodi questions 
were asked the average number of cigtirettes tJiey smoked per day, the yesi" ihaJ tliey 
started smob'ng, whether tliey had quit smoking,, and if so, what year. This 
information was used to calculate pack-years of smoking as (cigarettes per day * years 
smoked) / 20 cigarettes per pack. The most general measmeiiient of smoking !iistor>'' 
was constructed as an "ever/never" variable based on a participant's response to 
question i) above. 

[731 The btudx popuSation for the .iiiaiy.sis presesnted here incluJcd SIT) imidafed AMD 
patients with early (grade 3) or ad\'C!r)c«d (gmdes 4 and 5) AMD. Of ihe.>ie, 200 bad at 
icasl one aompled fatTected or unaffected) relative and thus rontrihuted to ihc family- 
based association uaalysis. The remaining flO AMD patients withoiit .satnpled 
relatives, and 259 uarolafod controls without AMD (grades i and 2), made up an 
indepeadctit case-control dataser. Demographic and clhrical information for these 
mdividuals is shown in table 4. 

Genotyping, Linicage and Assoclatioii Analysis 

174] Previous work by our group (Ksaealy et al. 2004) and o^ms (Weeks et al. 2004; 
Majewski et al 2003; Seddon et al. 2003; Iyengar et al. 2004) suggested the presence 
of an AMD susceptibility iocus on chromosome 10q26, with the linkage peak centered 
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at appu^^n^^alo^y 122 Mb Tu mrmw down fee regiott most likely to harbor aa AMD 
^osc optibiSin, allele, we g«not>ped 103 SMPs in the 1 12 to 132 Mb interval, exteiidiiig 
10 Mb to either side of the reported linkage peak. We started with a density of 
approximately 1 SNP per 1 Mb and filled m ths 117-127 Mb m^on immediately 
smrouttding the 122 Mb peak with a higher dsjosify of one SNP per 140 kb oa 
average. All SNPs were selected using SNPSeiector software (X« et ai, 2005) to have 
appraximately equal spacing with minor allele frequeacy > 5%. Gewtyping was 
perfonned wi& fee TaqMaa allelic discrimination assay, mitig either Assays-Ois- 
Demand or Assays-By-Design products ifrom A]5pUed Biosystems. For qimliiy cantml 
(QC) procedmes, two CEPH standards were inckded on each 96~well plate, and 
samples from six iadivxdxials were dapKcated across ali pktes, with the laboratory' 
technicians blinded to their identities. Analysis required matching QC genoiypes 
within sM across phiss mid at least 95% genotyping efficiency. The Y402H vmimxi 
of ifce CFH gene was geaotyped by sequencing, jis previously described (Haines et al. 
2005). 

|75] Following the first ronnd of genotyping and statistical anaij'sis, we applied iterative 
association mapping (Oliveira et al. 2005) to select another set of SNPs m the peak 

region, defined approxniiately as the Mod-score-imii support iatei-val .surroxmding the 
peak muitipoint lod score. In addition to using SNPSekctor (Xu et al 2005), SN?s 
were identifitx! through resequencing of the LOC3877I5 gene and the CUZDl gene 
(CUB arid zona pelhicida-like domains 1 [HGNC; 17937]) in 48-72 unrekted affected 
and unaftbcted individuals. Our final SNP density was an average of one SNP per 43 
kb, fot a total of 117 SNPs in fee 122-127 Mb region, and an avemge of one SNP 
eveiy 220 M> outside of this mterval, for a total of 185 SJ^s in the 112-132 Mb 
region. 

[76] The genotype data were analr^ed xvith MERLIN (Abecasis et a!. 2002) to calculate 
nonpanimetric two-point and multipoint LDD* scores (Kong and Cox 1997), using 
the exponential mwlel Allele freqaencies were estimated from all genotyped 
iiidividnais. Parametric aifecteds-ocly heterogeneity lod scores (HLODs) assnmitig a 
domiiiaut (disease allele frequency 0-01) or recessive (disease allele firetjuency 0.2) 
motld were also computed with MERLIN, To avoid an iniJation of linkage evidence 
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due to mtet-marker linkage dkequiUbrimu (DO) (Boyles et al. 2005), we «3ed recentiy 
described methods based on estimated liaplotyps frequeades of SIsfP clusters m Mgh 
pairwise IJD, using a thresboM of r^«0<l6 to define fttese clusters (Abec^is mid 
Wiggjnton 2005). The LD patt«m in fee region of iiiteJ-est was analyzed with tbe 
Haploview progmm (Barrett st al. 2005), »si»g tbe generated genotj-pes fmm 
unrelated AMD patients as tiie input. Association analysis was applied to ail SlNTs in 
the 122-127 Mb region, using the family-based Association in (he Presence of Linkage 
(APL) test (Martin et al. 2003) and standard logistic regi^ssion analysis for case- 
control comparisons with adjustment for age and sex (SAS version 8.02, SAS Institute 
luc, Cmy, NC). An additive coding scheme was used, witii flie SNP model covariate 
taking on values -1, 0 mid 1 for genotypes 1/1, 1/2, and 2/2, and 2 being the minor 
allele in controls. As described above, we divided ot«- total sample into cases 
contribntmg to the APL iinalysis (affected individoals with at least one sampled 
relative, femilies), and an indepetuleui sample of cases witiiout sampled 

relatives (a-610) who were compared to 259 unrelated controls. We nsed the 
Genotype-IBP Sharing Test (GIST) method (Li et ai. 2004) to examine which of the 
most strongly associated SNPs best explained the linkage evidetice in the region. We 
also used the COCAPHASE module of the UNPHASED sofMe package 
(IJndbridge 2003) to perform coiiditioual haplotype ana^'sis. TMs analysis tested 
whether conditioning on the nsfc allele at a particular SNP accounted for the 
assodatiott signal in the regioti. If the association signal in tlve region was driven by a 
single SNP, conditioning on its eifeot was expected to remove all evidence of 
association for the retaining SNPs. 



Interactioa Analysis 

{771 We conducted additional analyses to incoiporate effects of the two most itnporiaat 
known AIvlD risk factors, smoking and ihs CFH gene, First, we fit a series of logistic 
regression modek to the combmed case-«ontrol data set (including probands from, 
family-based dataset) to identify iho model that best described (1) the joijit effects of 
CFH and LCX:3S7715, and (2) the joint effects of smoking and LOC3S7715. We 
followed a recently proposed modeling strategy (North et aJ, 2005) in wliieh the best- 
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fitting model was derived ou the bass's of Akaike's Monnation CMterion (AIC). The 
AIC compares diOerent models with a log-likeiibood mtio test that is penalized for the 
number of model parameters to idmtify the aiost parsimomous model that sdeqxiately 
fite the data. For each geaolype, two model terms were tested: one coding for additive 
effects at the first, second, or both loci (ADDl, ADD2, ADDBOTH), usi«g the coding 
described above, aad die other one coding for domiaiatice etTects (DOM I, D0M2, 
DOIvIBOTH), with a value of -0.5 for genotypes 1/1 and 2/2, and a Yaltic of 0.5 for 
geaotype 1/2. Three additional models {ADDm', ADDDOM, DOMiNT) were fit to 
test for deviation iTom joint additive or joint dominance effeois of CFH md 
LOC3877i5, and two additioBal models (ABD^SMOKEJMT. DOM„SMOKE mX) 
were fit for LOC3S7715 and smoldng (comparing ever- vs. never-sraokers). Models 
for which the AIC differed by less than 2 units were ccsnsidei-ed statistically 
mdistiiiguishable (North et al. 2Q05), and the model with fewer parametei^ was 
chosen &s the best fitting otie. For example, when the addition of the ADDINT term 
did not provide a subsfaiitiaily better model fit, tltis was iateipreted as lack of 
evidence for statistical intemction between the two factors. Thus, they each had 
independent main effects that w«re nadUplicative {additive on the logarithmic scak) 
sueh that &e best estimate of odds mtio for being exposed to both factors was the 
prodact of the two main effect odds ratios. 

Our second approach for incoiporating AMT:)-assoeiated covariates was motivated by 
earlier r^orts of the 10q26 iiulcage e\'idence being due primarjiy to femilies with 
heavy smokers (Vv''eeks et al 2004), Similar to the previous study, we used an ordered 
subset analysis (OSA) (Haiiser st al. 2004) with the family-avemge of smokmg pack- 
years as a covariate. To avoid an nndue infiaenoe of zero paok-yeats values on family 
averages, paclc-years were coded as missing for non-smokers. Using the M^-to-iow 
oxderbig of feinily-averaged pack-years, OSA tested whether a subset of ferailiea with 
heavy smokers provided significantly greater linkage evidence than the reference 
dat^et, wMch in this case was restricted to families for whom non-missing covariate 
v^nes could be oompnted, Tkis, the b^eline lod score was computed for teilies in 
which thssre was at least one affected smoker with pack-years information. 
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W.E CLAmi; 

1 . A method for assessing moreased risk of Age Related Macular DegeneratioiL 
comprising; 

detemiiomg identity of at least one micieotide residue of Complement Factor 

IT coding sequence of a p6i"Son; 

ideiitii>i5ig the miclsotide residue as normal or variant by compariiig it to a 
normal sequence of Compleraent Factor H coding sequeiice as slio wn. in SEQ 
ID NO; 1, wherein a person witji. a variant se^umce lias a iligher- risk of Age 
Related Macular DegeneraHort than a person with a normal sequence. 

2. A method for assessing incsreased risk of Age Related Macular Degeneration, 
coiBprisuig: 

detemiining ideaitity of at least one amino aoid residue of Con^>lement Factor 
Hpiotemofaperson; 

identifying the residoe as normal or variant by comparing it to a nonnai 
sequence of Complement Factor H as shovm in SEQ ED NO; 2, wherein a 
person with a variant sequence has a higher risk of Age Rekted Macular 
Degeneration thaii a person with a noiraal sequence. 

3. The me&od of claim 1 wherein tha at least one mtdeotide is located in an exon 
encoding a polyanion binding donutin. 

4. The method of claim 3 wherein the poiyanion binding domain is selected fitm the 
group consisting of SCR 7, 12-14, md 19-20. 

5. The metiiod of claim 3 wherein the polyanion binding domain is a hepaiin binding 
domain selected fxom the $foiip consisting of SCR 13, 19, and 20. 

6. The method of claim S wh^in the polyanion binding domam is in SCR 7. 
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7. The method of claim 1 whwem the at least one uxicleottde is located ia m exou 
sncoding C-reactive pmtem bmcting domam, 

8. The method of claim 6 wherein the C-reactive protein biiidiag domaitx is in SCR 7. 

9. The method of claim 1 -whsxem the at least one mcleotide is located in m exoa 
encoding a C3b bindittg domain. 

JO. The laethod of claim 8 wherein the C3b bindiag domam is in ati SCR selected fi-om 
the group consisting of 1-4, 12-14, and 19-20. 

1 1. The me&od of claim 1 wherein the mcleotide variant identified is at at 1277 of SEQ 

IX)NO:L 

12. The method of claim 2 wherein the amino acid yariatit identified is at residue 402 of 
SEQiT>NO:3, 

13. The method of claim I wherein the nucleotide variant identified is T1277C of SEQ ID 
NO: 1. 

14. The mechod of oiaim 2 wherein the ammo acid vaj fent identified is Y402H of SEQ ID 
NO: 3. 

15. The method of claim 2 wherem die at least one amino aoid residue is located a 
polyanion binditig domaia. 

16. The method of claim 14 wherein &e polyanion binding domdn is sselected from the 
groupconsL^tingofSCR?, 12-14, and 19-20. 

17. The metliod of claim 14 %4ierein the polyanion binding domain is hi SCR 7. 

IS. The meflfciod of claim 2 wherein the at test one aiaino acid residue is located m a C~ 
reactive proteia brnding domain. 

19. The method of claim 17 wherein the C-reactive protsm bindiag domain is in SCR 7. 

20. The method of claim 2 wheKjm the at least one amino acid residoe is located in a C3b 
binding domain. 

21. The method of claim 19 wherem the C3fe binding domain is in an SCR selected ISrom 
the group consisting of 1-4, 12-14, and 19-20. 

22^ A mefhod tor screenmg for apolentiai drug for treating Age Related Macnlaf 
Degeneratjon, comprising: 
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contacting a Compfcmeat Factor H protein mth a test agent ia the presence of 
a polyaision; 

meastimigpolyanionbrndiag to Complemeat Factor H; 

idesQti fying a test agent as a potential drug for treating Age Related Macular 
Degeaieratioo if it increases binding of Complement Factor H to the poiyacion. 

23. The me&od of claim 22 wherein iiie polyamoii is heparin. 

24. The method of claim 22 wherein the poiyanion is sialic acid, 

25 . A method for scrceniiig for a poteatial dmg for treating Age Related Macular 
Degeaeration, comprismg: 

contaetmg a ComplemenC Factor H protein with a test agent m. the presence of 
C-ReactiveProfBin; 

measuring C-Reaotive Protein bixiMs^ to Complement Factor H; 

identifyfeig a test agent as a potential drug ht treating Age Related Macular 
Degeneration if it incteasss binding of Complement Factor H to C-Reactive 
Protein, 

26, The method of claim 1 wherein the micleotide residue is detenniiied by hybridisation, 

27, The method of claim 1 whetTesn the nucleotide residue is dstennined by primer 
extension. 

28 . The me'&.od of claim 1 wherein the nucleotide residue i s determined by ntioleotide 
sequencing, 

29. The method of claim 1 wherein, the nucleotide i-esidue is detenniaed by ailele-specific 
ampiificatiott. 

30. The method of claim 2 wherein the amino acid residue is determined by means of an 
antibody. 

3 1. A method to assess risk of AiMD in a patietit comprising: 

d^ennming whsths- the patient has a T allele at rsl0490924; 
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deteisniamg whether the patiejit is a cigarette smoker; and 
ideatifyiiig &e patient as; 

being at Mgh risk of Mffl if £he patient has the T allele 

and is a cigarette smokesTs 

bemg at lower risk of AMD if ths patieiji has tlie T 
allele but is not a cigarette smoicer or is a cigarette 
smoker but does not have the T alldes, and 
being at lowest risk if the patient does riot liave the T 
allele and is not a cigarette snioker, 

32. A method ix> assess risk of and treat AMP in. a patient compri sing: 
detamiimng whether the patimt has a T aHele at i-sl0490924; 
detemiiamg whether the patient is a cigarette smoker; and 
providing the patient with a behavioral therajjy to encouimge smokiiig 

cessation if the patient has the T allele at rsl0490924 and is a cigarette smoksar. 



33. A mediod to assess mk of and treat AMD in a patient comprising: 

determuiing whether the patient has a T allele at rs 10490924; 
deteiminitig whether the patient is a cigarette smoker; and 
providijig the patient witii smokeless nicothie to encourage smoking cessation 
if the patiejit as the T allele and is a cigaieltc smoker. 

34. The me{h»>d of claim 32 wherein tlie stop of providing comprises prescribing the 
hehavioral therapy. 

35. The method of claim 32 wherein the behavioral dieiapy is coonseling. 
36; The method of claim 32 wherein the behavioral therapy is a class. 

37, The method of claim 32 whea^in the behavioral therapy is infonnation. 
3S. HhQ method of claim 32 wherein the informatioja is printed matter. 

39. The method of claim 32 wherein the informati.on is on a data storage medinm. 

40. The method of claim 32 wherein the infoimation is on an audio tape. 

41. The method of claim 32 wbeiein the information is on a video tape. 

42. The method of claim 33 wherem ftie smokeless nicotine is nicotitte gum. 
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43. The m&thod of claim 33 wbereio fee smokeless nicotine is in a tonsilemial patch. 

44. The jrjefhod of claim 33 wherein the smokeless nicotine is in a nasal spray. 

45. The meOiod of cldm 33 whereia the smokeless nieotiGe is in an iahder. 

46. Hie method of claim 33 wherein tiie step of providing comprises prescribing or 
reoomraending a form of smokeless mcotiae. 

47. The method of ciainj 31 forther comprisiog deteimining if the paueat lias a variant of 
Compiemeat Factor H protein or coding seqaenee, 

48. The method of ciaiTn 47 wherein a variant protein is detetmined. 

49. The method of daun 47 wherein a variant coding sequence is determmed. 
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ttaatttgtg 



gctggata 
gaggtctsta 
actcctatta 
attgcaoaca 
atttggaaaa 
acgttgcctig 
agaat:9gctg 
tagatatcga 
cgaaa'catca 
gatgtgggaa 
tatttatgaa 
acfcatgastg 
gtggttacaa 
aaatagatgt 
tgaaattcto 
actttggatt 
ctcctgaaafc 
aagtggtgga 
gtgttgatgg 
atatacctga 
attcagtgga 
gtattcatgg 
gcaaatcatc 
acsattctaa 
taaatggaag . 
ctccacctca 
aaaaagtatc 
gaaaagatgg 
cacctcagat 
atgggactaa 
caacatgcta 
ctccaoctga 
aagaagttac 
gcttaggaga 
ctagctttga 
aagtgactfca 
ttaafcagcag 
cagtacaaaa 
tacgttatca 
atggaaactg 
bacctattga 
cagttgagta 
gaaatggaca 
ttatggaaaa 
caggtgaatc 
cattgcgaac 
atcaatcata 
attttttatg 
aaaatgtaat 



1G80 

1140 

1200 

1266 

1320 

138 Q 

1440 

1500 

ISSO 

1620 

1630 

1740 

IBOO 

XS60 

1920 

19S0 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2S80 

2640 

2700 

276C 

2820 

2860 

3940 

3000 

30S0 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3S0O 

3926 



<21Q> 2 
<311> 3926 

<2ia> mh 

<213> EoTOO sapiens 



wo 2(Mt6/0'>6561 



mh./ U S CI B ./ O 7 7 E S 




CDS 

(74) . (37S7) 



<22(}> 

<221> variation 
<222> {L2-rn . . . (1277) 



<22i> poly?iso-.t:phic variant 
<:400> 2 

aattcttgga agaggagaac tggacgttgt gaiacagagtt agctggtaaa tgtcctGt.tii 
aaagstccaa aaaatgagac ttctagcasa gattafcttgc cttatgttat gggctatttg 
tgtagcagaa gattgcaat.g aaci:i;cct:cc aagaagaaat acagaaattc tgacaggttc 
ctggtctgac caaacatatc cagaaggaac ccaggctatc tataaatgcc gccctggana 
fcagatctctt. ggaaatgtaA taatggt.atg caggaaggga gaatgggttg >-::tnttaatcc 
attaagga?,* vg:;caga«.ia gycctit.gtgg acatcccg-ga gatactcctt ttggtac'cct 
tacccttaca ggaggaaaCg Lgtttgaata tggtgtaaaa gctgtgtata catgtaatg.a 
ggggtatcsia titgctaggtg agattaaf:i:.a cootgaiitgt gacacagatg gatggaccaa 
tga-actcc-i: atatglg^.ag tt.gtgaagtg 'czza.zos.gzg acagaaccag agaacggaaa 
aat "-gccagc agtgcciCHtigg uaccagatcg gg?.ata."ca;: cttggacaag cagtacggtt 
tgtatgtaac^ i:.caagc-:aca agattgaagg agatgaagaa atgcattgtt cagacgatgg 
tttttggagt: iiaagagaaac caaagtgt-gt ggaaatttea tgcaaatccc cagatgttat 
aaatggatct; cctataccnc agaagaLLat ttataaggag aatgaacgat ttcaafcataa 
atgtaacatg ggttatgaat acagtgaaag aggagatgct gtafcgcactg aatctggatg 
gcgtccgttg cchfccatgtg aagaaaaatc atgtgataat ccttatattc caaabggtga 
ctactcacct ttaaggatta aacacagaac tggagatgaa atcacgtacc agfcgfcagaaa 
tggt-ttttat cctgcaaccc ggggaaatac agccaaatgc acaagfcactg gqfcggatacc 
tgctccgaga tgtaccbtga aaccttgtga ttafcccagac attaaacatg gaggtctata 
tcatgagaat atgcgtagac catactttcc agtagctgta ggaaaatatt actcctatta 
cfcgfc-gatgaa cattttgaga ctccgtcagg aagttacfcgg gatcacattc attgcacaca 
agatggatgg tcgccagcag taccatgcct cagaaaatgt tattttcctt atttggaaaa 
tggatataat caaaatcatg gaagaaagtt fcgtacagggt aaatctatag acgttgcctg 
ccatcc^ggc tacgctcttc caaaagcgca gaccacagtt acatgtatgg agaatggctg 
gtctccbact; cccagatgca tccgtgtcaa aacatgttcc aaataaagfca tagatattga 
gaatgggttc atttcfcgaat ctcagtatac atatgactta aaagaaaaag cgaaatataa 
atgcaaactia ggatatgtaa cagcagatgg tgaaacatca ggatcaatta gatgtgggaa 
agatggatgg tcagctcaac ccacgtgcat taaatcttgt gatatcccag tatttatgaa 
tgcjcagaact aaaaatgact tcacatggtt taagofcgaat gacaoattgg actatgaatg 
ocatgatggt tatgaaagca atactggaag caacactggt tccatagtgt gtggttacaa 
tggttggtct gatttaccca tatgttatga aagagaatgc gaacttccta aaatagatgt 
acacjttagtt octgatcgca agaaagacca gtataaagtt ggagaggtgt tgaaattctc 
ctg«aaacca ggatttacaa tagttggaco taattccgtt pagtgctacc actttggatt 
gtctccfcgac ctcccaatafc gtaaagagca agtacaatea tgtggtccac ctcctgaact 
cctcaatggg aatgttaagg aaaaaacgaa agaagaatat ggacacagtg aagtggtgga 
atattattgc aatcctagat ttctaatgaa gggacctaat aaaattcaat gtgfctgatgg 
agsgtggaaa actttaccag tgtgtattgt ggaggagaqt acctgtggag atabacctga 
Rctt.gaacat ggctgggccc agctttcttc ccctccttat tactatggag attcag-gga 
attraat::yc t cvigaat caL ttacaatgat tggacacaga t*3aafctacgt gtattcatgg 
agi-afcaq.ico caaoctcccc agtgtgtggc aatagataaa cctaagaagt 
aaal:irt:a^!-i- atdcctgagg aacatttaaa aaacaagaag gaar.taga'cc ataattc:t:aa 
cafcctcijyt a--j agatytagag gaaaagaagg atggatacac acagtctgca -^aaacggaag 
a^ggy-stvCrf yaag'^gaact gctcaatggc acaaatacaa tfcatgcccac ctccacctca 
gatccccaat tctcacaata fcgacaaccac actgaattat cgggatggag aaaaagtatc 
tgttccttgc caagaaaatt atctaatfcca ggaaggagaa gaaatbacat gcaaagafcgg 



60 
120 
1.3C 
240 
300 
350 
■120 
480 
&40 
SOO 
660 
720 
780 
840 
900 
9S0 
1020 
1080 
3.140 
1200 
1260 
1320 
1380 
1440 
ISOO 

iseo 

1620 
1680 
1740 
1800 
ISfiC 
192C 
1380 
2040 
2100 
.^160 
^220 
22^0 

:;j40 

2400 
2460 
Ab^Q 
2580 
2640 



-It. crjAcrLtat acLtgtgagg 

aasatqq c oa"CGtccat 
aaatcfc-catci cccatuwi-a^ 
Oti'-v-tgtoca AcatattaJ'a 
atggacagga aggccaaaat 
tgctfcatata gtgtcgagac 
atgtaggagc ccttatgaaa 
gacggaacca cctcaatgca 
caatggggac attacttcat 
Gcaatgccag aaGtegt^tr- 
afeggtca^aa ccaccaaaa 
ttstaacata qra^Laagy- 
agtt-qaattt 3Lg,fj_aaac 
a=i("=.i qi~t^^ yatgggaaa-" 
a-g-cgci^^c cttrrt-tv,-iM 
t-^ttjtttta oLccttrtt- 
-ataagrsga Qaocggtggc 



tctgtgttga 

gtqgt t tv-c-y 
fdc c tct^gtg 
;;tcafatgt c 
i,"giaa! ca 

aq t .g-ta^ga 
doatgg^i^gg 
gcagagacac 
agafcgagtaa 
tgfcttgggga 
aagattctac 
fere qt t gt t 
,xM t i gaqy.j 
gvltaf 
yg^c?>grrji^ 
gggg.lL t<.y 
jgag t at c c 
aa t ' t agra 
tt.a. ijgLa 
tcfcctt 



^yaiaggcctt 
agacagttat 
tgggcGCgca 
ciacaja-fcgt 
9 g^auaag 
agccagtaat 
c tcctgtgtg 
atatccatct 
tgaagaagfcg 
aggaaaatgt 
.gtat^itgct 
tctc.c?agcga 
gfigtgtaata 
aRagaaqctt 
tccctcatca 
aacttgtgca 
ttaaatcagt 
aaattttgga 



cctfcgtaast 
cagtatggag 
attgcaaaab 
ctcagt-ctac 
gcgggtgagc 
gtaacatgca 
aatccgccca 



atgtgtttaa 

gggccccctc 
i-cagcttcst 
ataaoatgta 
tcccgagaaa 
tattcgagaa 
cgttctcaca 
aaaagafeaga 
tctcaatttc 
ttaatttgtg 



cacctcagPfT'l-' 
atgggactaa 
caacatgcta 
ctccacctga 
aagaagttac 
gcttaggaga 
ctagctttga 
aagtgacfcta 
ttaatagaag 
cagtacaaaa 
tacgttatca 
atggaaactg 
cacctattga 
cagttgagta 
gaaatggaca 
bfcatggaaaa 
caggtgaatc 
cattgcgaac 
atcaatcata 
afctttttatg 
aaaatgtaat 



2760 
2 82 0 
2830 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
33S0 
3420 



3480 
3 540 
3SO0 
3660 
3720 
3780 
384C 
3900 
3926 



<210> 3 

<2lx> 1231 

<2X2> PRT 

<213> Hoao sapiens 

<220> 

<221> SIGNAI. 
<222> {!) . . . SIS) 



<220:> 

<221> VARIANT 
<.23a> (402) . , . (402) 
«;223> polymorphic residue 



<400> 3 

Met Arg T.,eti Leu Ala Lys lie lie Cya Leu Met Leu, Trp Ma lis Cys 

1 5 10 ' ■ 15- 

val Ala Sl« Asp Cys Asn Glu Leu Pro Pro Arg Arg Tto Glu lie 

20 2S '30 

Leu Thr Gly 3mx Trp Ser Asp Gin Thr Tyr Pro Glu Gly Thr Gin Ala 

35 40 45 

lie Tyr Lys Cys Asrg fro Gly Tyr Arg Ser L&a Gly Asn Val He Met 

50 55 SO 

Val Cys Axg Lys Gly Slu Trp Val Ala Leu Asa Pro Leu Arg Lys Cys 
S5 ~ 70 ' 7S 80 

Gin Lyg Arg Pro Cys Gly His Pro Gly Asp Tto Pro Pfae Gly Thr Phe 

SB 90 SB 

Thr Leu t'hr Gly Gly Asn Val She Glu Tyr Gly Val Lys Ala Val tCyr 

100 105 110 

Thr cys Asn Glu Gly Tyr Gla Leu Leu Gly Glu He Aan Tyr Arg Glu 

115 120 125 

Cyg Asp Thr Asp Gly Trp Thr Asn Asp He Pro He Cys Glu Val Val 



WO2«W0<M56t 



13J3 — 

*S/T-6vPlfe9Sr^' W'liPaia Pro Glu Asa C-ly Lye lie Val Ser Ser 
145 ' ISO 155 160 

Ala Met Glu Pro Asp Axg Glu Tyr His gha Sly Gin Ala Val Arg Phe 

1S5 170 175 

Val Cys Asn Ser Gly Tyr iys lie Glu Gly Asp Glu Glxj Met His Cys 

180 185 . 190 

Ser Asp Aap Gly Phe Trp Ser Lys Glu I*ys Pro Lys Cys Val Glu He 

195 200 20S 

Ser Cys Lys Ser Pro J^p Val lie Asn Gly Ser Pro lie Ser Gin Lys 

2X0 215 ^'^^ 

Il« lie Tyr Lys Glu Asn Qln Arg )Pha Gin Tyr l.ys Cys Asn Met: Gly 
225 230 235 240 

Tyr Glu "Tyr Ser Glu Arg Oiy Asp Ala Val Cys Tbr Glu Ser Gly Trp 

245 2S0 255 

Arg Pro Leu Pro Ser Cys Glu Glu Lys Ser Cys Asp Asn Pro Tyr He 

260 265 270 

Pro Asn Gly Asp Tyr Ser Pro Leu Arg Xle Iiys His Arg Thr Gly Asp 

275 280 285 

Glu lie Thr TYX Gin Cys Arg Asn Gly Phe Tyr Pro Ala Thr Arg Gly 

290 295 300 

Asn Thr Ala Lys Cys Thr Ser Thr CJly Trp He Pro Ala Pro Arg Cys 
305 310 315 320 

Thr I..8U Lys Pro Cys Asp Tyr Pro Asp lie I^ys Kts Gly Gly Leu Tyr 

325 330 335 

His Glu asn Met Arg Arg Pro Tyr Phe Pro Val Ala Val Gly Lys Tyr 

340 345 350 

Tyr Ser Hyr Tyr Cys Asp Glu His Phe Glu Thr Pro S«r Gly Ser Tyr 

355 3S0, 365 

Trp Asp His II© His Cys Thr Gin Asp Gly Trp Ser Pro Ala Val Pro 

370 375 380 

Cys Leu Arg Lys Cys Tyr Phe Pro Tyr Leu Glu Asn Gly Tyr Asn Gin 
335 390 39S 400 

Asn Tyr Gly Arg I^ys Phs Val Sin Gly Lys Ser lie Asp Val Ala Cys 

405 410 415 

His Pro Gly T>'r Ala Leu Pro Lys Ala Gin Thr Thr VaX Thr Cys Met 

420 425 430 

Glu Asn Gly Trp Ser Pro Thr Pro Arg Cys Xle Arg Val Lys Thr Cys 

435 440 44S 

Sar Lys Ser Ser He Asp He Glu Asn Gly Phe He Ssr Glu Ser Gin 

450 ^ 455 460 

Tyr Thr Tyr Ala Leu Lys Glu Lys Ala Lys Tyr Gin Cys Lys Lett Gly 
465 470 475 480 

Tyr val Thr Ala Asp Gly Glu Thr Ser Gly Ser lie Arg Cys Gly Lys 

485 490 49S 

Aep Gly Tzp Ssr Ala Gin Pro Thr Cys He Lys Ser Cys Asp lie Pro 

500 505 SIO 

Val Phe Mat Asn Ala Arg Thr hys Asn Asp Phe Thr Trp Phe Lys Leu 

515 S20 ' ' " 525 

Assn Asp Thr Leu Asp Tyr Glu Cys His Asp Gly Tyr Glu Ser Asn Thr 

530 535 540 ^ 

Gly Ser Thr Thr Gly Ser He Val Cys Gly Tyr Asn Gly Trp Ser Asp 
545 S50 555 S60 

Leu Pro He Cys Tyr Glu Arg Glu Cys Glu Leu Pro Lys He Asp Val 
565 570 575 



»et»Ja©»f a!** S ' sSc " 

l,en hvB Phs Ser Cvb Lys Pro Gly ?hei Thr lie Val Gly Pro Ami Ser 

595 600 505 

Val Gin nvf! Tyr His She Gly Leu Ser Pro Asp L«u ero lie Cys hys 

610 ^ ■ 61 S 520 

Gj-u a in Vftl Gin K^r Cys Glv Pro Pro t>ro Glu tau Leu Asn Gly Asn 
635 630 ^ €35 640 

val hys Gl\- Lvs Thr Lys Gl\t Glu Tyr Gly Kis Ser Cil« Val Val Glu 

645 650 655 

Tyx Tyr Cys Asn Pro Arg Phe Leu Met Bys Gly Pro Asn X.ys He Gin 

6S0 S6S 670 

cys Val Asp Gly Glu Trp Thr Thr Leu Pro Val Cys lis Val Glu Glu 

675 600 €85 

Ser Thr Cys Gly Asp Xle ?ro Glu Leu Glu His Gly Trp Ala Gin Leu 

690 65S '-'00 

Ser Ser Pro Pro Tyr Tyr Tyr Gly Asp Ser Val Glu Phs Asn Cys Ser 
705 710 715 720 

Glu Ser Plie Thr Met He Gly His Arg 'Ser He Thr Cys He Kis Gly 

725 730 73S 

Va.1 Trp Thr Gin Leu Pro Gin Cys Val Ala He Asp Lys I.eu Lys Lya 

740 7-15 750 

Cvs Lys Ser Ser Asn Leu He He Leu Glu Glu His Leu Lys Asn Lys 

7SS 7S0 -765 ' ~ 

Lys Slu Phe Asp His Asn Ser Asn He Arg Tyr Arg Cys Arg Giy Lys 

770 775 '780 

Glu Gly Trp He His Thr Val Cys lis Asn Gly Arg Trp Asp Pro Glu 
785 790 7S5 800 

Val Asn Cys Ser Net Ala Gin He Gin Leu Cys Pro Pro Pro Pro Gin 

805 810 S15 

He Pro Asn Ser His Asn Met Thr Thr Thr Leu Asrs Tyr Arg Asp Gly 

820 825 830 

Glu Lys Val Ser Val Leu Cys Gin Glu Asn Tyr Leu He Gin Glxi Gly 

S35 340 S45 

Qlu Glu He Thr Cys Lys Asp Gly Arg Trp Gin Ser He Pro Leu Cya 

850 855 860 ' 

Val Glu L.ys He Pro Cys Ser Gin Pro Pro Gin He Glu His Gly Thr 
865 870 875 880 

He Asn Ser Ser Arg Ser Ser Gin Qlu Ssr Tyr Ala His Gly Thr Lys 

885 S90 835 

Leu Ser Tyx Thr Cys Glu Gly Gly Phe Arg He Ser Glu Glu Asn Glu 

900 90S 910 

Thr Thr Cys Tyr Met Gly Lya Trp Ser Ser Pro Pro Gin Cys Glu GXy 

SflS 320 925 

Leu Pro Cys Lys Ser Pro Pro Glu He Ser His Gly Val Val Ala His 

93C5 935 ' ■ 940 

Met Ssr Asp Ser Tyx Gin Tyr Gly Glu Glu Val Thr Tyr Lys Cys Phe 
945 950 35S 560 

Glu Gly Pbe Gly He Asp Gly Pro Ala He Ala Lys Cya Leu Gly Glu 

S6S 970 975 

Lys Trp Ser His Pro Pro Ser Cys He Lys Thr Asp Cys Leu Ser Leu 

980 985 S30 

Pro Ser Pbe Slu Aen Ala He Pro Met Gly Glu hys Lys Asp Val Tyr 

$SS 1000 1005 

Lys Ala Gly Glu Gin Val Thr Tyr Thr Cys Ala Thr Tyr Tyr Lys Met 



"''iSJ'MltePH/W^II^ Cys lie Asa Ser tog Trp Thr Gly Arg 
1025 1030 1035 1040 

Pro Thr Cvs Arg Asp Thr Ser C>'S Val Asn Pi-o Pro Thr Val Gin Asn 

1045 1050 loss 

Ala Tyr Xle Yal Ser Arg Gin Met Ssr I^ys Tyr Pro Ser Gly Glu Ai-g 

1060 1065 1070 

Val Arg Tyr Gin Cys Arg Ser Pro Tyr Glu Met Ph« Gly Asp Glu Glu 

1075 X080 1085 

Val Met Cys Leu Asn Gly Asa Trp Thr GIu Pro PEO Gin Cys Lys Asp 

1090 1095 1100 

Ser Thr Gly Xjys Cys Glv Pro Pro Pro Px-o lie Asp Asa Gly Asp lie 
11 OS 1110 1115 1120 

-mr ser ?he Pro Leu Ser Val T^'r Ala Pro Ala Ssr Ser Val 61a Tyr 

112S 1130 li35 

Gin Cys Gin Asn I.eu Tyr Gin Leu Glu Gly Asn Lys Arg lie Thr Cys 

1140 1145 ilSO 

Axg Ask Gly Gin Trp Ser Glu Pro Sro Lys Cys Leu His STO Cys Val 

11 55 iiso lies 

lis ser Arc? Glu He Met Gla Asn T^-r Asn lie Ala Leu Arg Trp Thr 

1170 ~ 1175 1130 

Ala Lys Gin Lys Lea Tyr Sor Arg Tto Gly Glu Ser Val Giu Phe Val 
1185 IISO li^S 1200 

Cys Lys Arg Gly Tyr Arg Leu Ser Ser Arg Ser His Thr Lea Arg Thr 

1205 1210 1215 

Thr Cys Trp Asp Gly Lys Leu Glu Tyr Pro Thr Cys Ala Lys Arg 
1220 122S ' 1230 



<210> 4 

<211> 1231 

<212> PRT 

<213> Hotao sapiens 



<220> 

<221> SIGiiM, 
<232> {!)... (18) 



<220> 

<22i> vmiimr 

<222> (403) , , . (402) 
<;223> pol^TTiOrphic residue 



<400> 4 

Met Arq Leu Leu Ala Lys lie lie Cys Leu Met Leu Tirp Ala He Cys 

1 " 5 10 15 

Val Ala Glu Asp Cys Asn Glu Leu Pro Pro Arg Arg Asn Thr Gla lie 

20 25 30 

Lea Thr Gly Ser Trp Ser Asp Gin Thr Tyr Pro Glu Gly Thr Gin Ala 

35 40 45 ' 

He Tyr Lys Cys Arg Pro Gly Tyr Arg Ser Leu Gly Asn Val He Met, 

SO ■ 55 60 

Yal cys Arg Lys Gly Glu Trp Val Ala Leu Asn Pro Leu Arg Lys Cys 
65 70 80 

Gin Lys Arg Pro Cys Gly His Pro Gly Asp tis: Pro Phe Gly Thr Phe 



■^UtJtl^J^-h^^h.^ I-J/Asri'^di Pna aj.u Tyr Gly Val Lys Ala Val T^x: 
100 105 110 

T,i_ -Y ''■•nr Jlu 01 . Tvi v--n L i.e i Gly Olu lie Asn Tyr Arg Glu 

120 125 
^v- lii^p rhr A&p Glv Trp Tnr Asn Asp Tie Pro lie Cys Gla Val Val 

130 135 140 

L^t> r ieu Pro Val Thr Ala Pro 31u Asn Gly Lys lis Val Ser Ser 
lis ISO 155 160 

Mtit Glu Pry 2^3?; "ra Glu l-yr His Phe Gly Gin Ala Val Arg Phe 
16=^ 170 175 

Val Cys ASB Ser Gly Tyr hys. Xle Glu Gly Asp Glu Glu Met His Cys 

lao 186 190 

Ser Aap Asp Gly Phe Trp Ser hyB Gin Lys Pro l.ys Cys Val Glu He 

135 200 205 

Ser Cys Lys Ser Pro Asp Val He Asn Gly Ser Pro lie Ser Gin hy& 

310 215 220 

■?-le lis Tvr Lys Glia Asn Glu Arg Phe Gin Tyr Lys Cy-s Asn Uet Gly 
225 * 230 23S 240 

Tyr Glu VyT Ser Glu Arg Gly Asp Ala Val Cyi3 Thr Glu Ser Gly Trp 

245 250 2S5 

Arg Pro Leu Pro Ser C^'s Glu Glu X.ys Ser Cys Asp Asn Pro Tyr lie 

260 265 370 

^ro Asn Gly Asp 'iyr Ser Pro Leu Arcy Xle Lys His Arg OShr Gly Asp 

275 280 285 

Glu lis Thr Tyr Qln Cys Arg Asn (Sly Pha Tyr Pro Ala Thr Arg Gly 

290 295 300 

Asa Thr Ala i.ys Cys Thr Sar Thr Gly Trp lie Pro Ala Pro Arg Cys 
305 310 313 • 320 

Thr Lau Lys Pro Cys Asp Tyr Pro Asp lie Lys His Gly Gly Leu T^^r 

325 330 335 

Hie Glu Asn Met Arg Arg Pro Tyr Phe Pro Val Ala Val Gly Lys Tyr 

340 345 350 

Tyr Ser Tyr Tyr Cys Asp Glu His Phe Glu Thr Pro Ser Gly Ser Tyr 

3SS 3 SO 355 

Trp Asp His He Hie Cys Thr Gin Asp Gly Trp Ser Pro Ala Val Pro 
370 375 380 

Cys Leu Arg Lys Cys Tyr Phe Pro Tyr Leu Glu Asn Gly Tyr Asa Gin 
335 390 3&S 400 

Asn His Gly Arg Lys Phe Val Gin Gly Lys Ser Xle Asp Val Ala Cys 
405 410 415 

His Pro Gly 'TyT Ala Leu Pro Lys Ala Gin Thr Thr Val Thr Cys Met 

420 425 430 

Glu Asn Gly Trp Ser Pro Thr Pro Arg Cys lie Arg Val Lys Thr Cys 

435 " 440 445 

Ser Lys Ser Ser Xle Asp He Glu Asn Gly Phe He Ser Glu Ser Gin 

450 455 460 

Tyr I-far Tyr Ala Leu Lys Glu Lye Ala Lys Tyx Gin Ci'-s Lys Leu Gly 
465 470 475 480 

Tyr Val Thr Als Asp Gly Glu Thr Ser Gly Ser lie Arg Cys Gly Lys 

485 490 ■ 495 

Asp Gly Trp Ser Ala Gin Pro Thr Cys He Lys Ser Cys Asp He Pro 

500 505 510 

Val Phe Met Asn Ala Arg Thr Lys Asn Aisp Phs Thr Trp Phe Lys Leu 
515 520 525 



^.,,3 vj^g,..WO2(Mt6/0'>65Al r^,^ p^^, ;:,sp .-;;.v i-. L' alu Ser as:i ThrPCT/US2«(k)/fi07725 

•■^t::t.5^psoii/& " " "340 

Qly Ser Vav Thr Gly Ser Ii<rf Val Cys Gly Tyx Asu Giy urp Ser Aep 
545 SSO S55 560 

Leu Pro lie Cys Tyr Qln Arg Glu Cys Qlu h&u Pro Lys lie Asp Val 

565 570 S7S 

His Leu Val Pro Asp Arg Lys Lys Asp Gin Tyr I*ys Val Gly Glu Val 

580 58S 590 

Leu Lvs Phe Ser Cya t.ys Pro Gly Phe Thr lie Val Gly Pro Asa Ser 

595 600 60B 

Val Gin Cys 'ryr His Phe Gly Leu Ser Pro Asp Leu Pro He Cys Lys 

SIO 615 620 

Qlu Gin Val Gin Ser Cys Gly Pro Pro Pro Glu Leu Leu Asn Qly Asn 
62S 630 

val Lys Gl« Lys Thr Lys Glu Glu Tyr Gly His Ser Qlu Val VaX Glu 

545 630 SSS 

Tyr Tyr Cys Asn Fro Arg ?he Leu Met Lys Gly Pro Asn Lys He Gin 

660 SS5 670 

Cys val Asp Gly Glu Trp Thr Thr Lsu Pro Val Cys lie Val Glu Glu 

S75 680 €8S 

Sar Thr Cys Gly Asp He Pro Glu Lsu Glu His Gly Trp Ala Gin Leu 

690 595 700 

Ser ser Pro Pro Tyr Tyx Tyr Gly Asp Ser Val Glu Pbe Asn Cys Ser 
705 710 715 720 

Glu Ser PhB Thr Met He Gly His Arg Ser He Thr Cys He His Gly 

725 730 '^35 

Val Trp Thr Gin Leu Pro Gin Cys Val Ala He Asp Lys Leu Lys Lys 

740 745 7S0 

Cys Lys Ser Ser Asn Leu He He Leu Glu Giu His Leu Lys Asn Lys 

7SS 760 765 

LvH GIis Phe Asp His Aan Ser Asn He /ura Ty^-' Arg Cys Arg Gly Lys 



770 



780 



Glu Gly tt-o lie His Thr Val Cys He Asn Gly Arg Trp Asp Pro Glu 
785 790 795 800 

Val Asa cya Ser Met Ala Gin He Gin Leu Cys Pro Pro Pro Pro Gin 

805 8X0 815 

He Pro Asn Ser His Asn Met Thr Thr Thr Leu Asn Tyr Arg Asp Gly 

820 S2S S30 

Glu Lys val Ser Val Leu Cys Gin Glu Asn Tyr Leu He Gin Glu Gly 

835 840 845 

Sltt Glu He Thr Cys Lys Asp Gly Arg Trp Gin Ser Ha Pro Leu Cys 

850 S5S 860 

Val Glu Lys Xle Pro Cys Ser Gl» Pro Pro Gin He Glu His Gly Ttir 
S65 870 87S ' 880 

He Asn Sar Ser Arg Ser Ser Gin Glu Ser Tyr Ala His Gly Thr Lys 

885 890 895 

Leu Ser Tyr Thr Cys Glu Gly Gly Phe Arg He Ser Glu Glu Asn Glu 

SOO SOS 910 

Thr Tbr Cys Tyr Met Qly hys Trp Ser Ser Pro Pro Gin Cys Glu Gly 

915 920 ■ 925 

Leu Pro Cys Lvs Ser Pro Pro Glu He Ser His Gly Val Val Ala His 

930 * 335 940 , 

Met ser Asp Ser Tyr Gin Tvr Gly Giu Glu Val Thr Tyr Lys Cys Phe 
945 950 955 ' - ' 9S0 

Giu Gly Phe Gly tie Asp Gly Pro Ala He Ala Lys Cys Leu Gly Glu 



wo 2(Mt6/0'>65<Vl 375 PClWS2«Wfi07725 

■^l^^te^i^l^^'^^^^^ Cys lie liys Thr isp Gys Leu Ser Leu 

Pro t3er Phe GJ.u Asn Ala lie V-To Met Gly Glu Isys I^ys Asp Val t'yr 

995 1000 1005 

Lyi5 AlK Giy Giu aia val Thr Tyr rhr Cysj Ala Thr Tyr Tyr Ijys Met 

lOiO lOlS 1020 

Ajsk C-ly Aia Ssr Aen Val Thr Cys lie Asn Ser Arg Trp Thr Gly Arg 
102S ' 1030 1035 1040 

Pro Thr Cys Arg Asp Ito Ser Cys Val Asn Pro Pro T^hx: Val Gin Asn 

104S 1050 3.0SS 

Ala Tyr lie Val Ser Arg Gin Met Ser Lys Tyr Pro Ser Sly Giu Arg 

1060 1065 1070 

Val Arg Tyr GXn Cys Arg Ser Pro Tyr Giu Met Phe Gly Asp Glxi Giu 

107S 1080 1085 

Val Met Cys Beu Asn Gly Asn Trp Thr Giu Pro Pro Gin Cys Lys Asp 

1090 109S 1100 

Ser Thr Glv l.ys Cys Gly Pro Pro Pro Pro lie Asp Asn Gly Asp lie 
1105 * " lilO 1115 1120 

Thr Ser Phe Pro hsn Ser Vail Tyr Ala Pro Ala Ser Ssr Val Giu Tyr 

1125 1130 1135 

Gin CVS Gin ksn l<eu Tyr Glii l>«u Giu Gly Asn X^ys Arg lie Thr Cys 

1140 1145 1150 

Arg Asn Gly Gin Trp Ser Giu Pro Pro Lys Cys Leu Hia Pro Cys Val 
115S 1160 lies 

lie ser Arg Giu Xle Met Giu Asn Tyr Asn lie Ala Leu Arg Trp Thr 

1170 1175 1180 

Ala Lys Gin hys Leu Tyr Ser Arg Thr Gly Giu Ser Val Giu Phe Val 
1185 IISO 1195 1200 

Cya Lys Arg Gly Tyr Arg Leu Ser Ser Arg Ser His Thr Leu Arg Thr 

1205 1210 1215 

Thr Cys Trp Asp Gly hya Leu Giu Tyr Pro Thr Cya Ala Lys Arg 
1220 1225 1230 



<210> S 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 5 

ggtttcttct. fcgaaaatcaa agg 

<:210> 6 

<21i> 22 

<212> »m 

<213* Homo sai)ienB 

ccatfcggtaa aacaaggtga oa 

<210> 7 
<2il> 107 
<212>. PET 

<213> Homo sapiens 



10 



iC-^^^'o'"^' ^M' '"'^'■^ ^^''^^ 


Isi £1 


•••^^ 5:31 Si:::; 




f'Sefc Leu Ai,y 




Pro Giy Pro M&t Val I&i^ Glu Ala Glu Gly 


Lye 






10 15 




,f .1 -^.^ 
Giy Griy r'TO 


-1 MPt- 


Ali-i i9ar Leu Ser Ser Ser Val Val Pro Val 


Ser 




25 30 




Kis XI© Sex" 




Arg Glu Ser Val Leu Asp Pro Gly Val Gly 


Gly 






40 4S 




Glu Gly Ala 


Ser hap 


l>ys Gin Arg Ser Lya Leu Ser i.su Ser His 


Ser 


SO 




55 60 




Met lie Pro 


Ala Ala Lys lie His Hir Glu Leu Cys Leu Pro Ala 




65 




70 75 


80 


Phe Ssr Pro 


Ala Gly 


Thr Gin Arg Arg Pbe Glu Gin Pro Gin His 


Kis 



85* 90 95 



I,eu Thr Leu Ser He lie His Thr Ala Ala Arg 
3.00 105 



<210> a 
<211> 808 
<212> BHA 

<213» Hoiiio sapiens 



<4:00.v S 

gaaatggcag ctggcttggc aaggggacag 
Gctacatgct gcgcctataa ccaggaccga 
cbgagatggc aagtctgtcc tccteggfcgg 
agtctgtgct ggaccctgga gttggtggag 
tgtctfc.tatc acactccatg atcccagctg 
ccttcttctc tcctgctgga acccagagga 
tgtctatcat ccacactgca gcaaggtgat 
ctggagcttc tcatcagcat caatgtgaag 
cct.cacaacc tagaafcggto cccttccctc 
tcccacctgc ggccacactg tgcaacctgg 
catcaccaat tggatgcatc ttcbgcfcctg 
agatgcagcc caatcttctc ctaacatctg 
tcatcctgcc tttgbfcttct tgccctcctfc 
ttaaataaaa attcagatea fcccttgca 



cacctttgto accacactat gtccctgfcac 60 
tggtaactga ggcggagggg aaaggagggc 120 
ttcctgtgtc ctteatttcc actctgcgag 180 
aaggagocag tgacaagcsg aggagcaaac 240 
ataaaatcca cactgagctc tgcttaccag 300 
gqttccagca gcctcagcac caectgac.ac 360 
tctgccaaaa catatctcct. taaaagccaa 420 
ccaaaaatcc ttaggaggac agagggagtc 480 
cagccgcctc aactgtccac aggactctct 540 
aatttcccca cctgggcgga ctcaf.cacgt 600 
tgcagctggt gaaatctttc tcaacccttg 660 
gattcctctc tgtcactgca ttccctcctg 720 
tcfcctcccgg gcgataggca ttaactaaaa 780 
808 



<:210> 9 
<211> 51 

<2i2> mh 

<213> Homo sapiens 

<400> a 

tttatcacac tceatgatcc cagctkctaa aatccacacfc gagctctgct t 

<210> 10 

<2i2> DKA 

<:213> Homo sapiens 



<400> 10 

gtiggaaacct cagcctgctt ctcgtycggg ttgttagagg agtcatttag a 



<212> DNA 

< 2 3.3 s. Konio sapiens 

tatt ctcacggctt tcaagkgctc attttfccctg ctcatttatg : 



<2X2> DKA 

<213> Homo sapiens 



gt(:;igaggag tcatttagaa agctgkacoa ttctttaaat attctcaegg c 



<-,3ao> 13 

<;211> 51 

«;213> DNA 

<21^> Hcmo 33.piens 

<40C> ?-3 

ctcagcctgc ttcfccgtccg ggttgktaga ggagtcattt agaaagctgt a 

cSlC> 14 

<2ia> 51 

<212> DMA 

c213> Homo sapiens 

<4Q0> 14 

fcccgggttgt tagaggagtc atttaraaag afcgtaccatt ctttcaatat t 

«210> 15 

<211> 51 

<212> TMK 

<213> HoTdO sapiens 

<400» 15 

taccattctt tcaatattdt cacggyttta cagtgctcafc ttttcctgcfc c 

<2i0v 16 
<211> 51 
<212> DNA 

<213> So«o sapiens 
<400> 16 

gaaactgagc agcagcaggc ctgggkttgg cttfctaagha fcctatafctta a 

<2X0> 17 

<21%> 50 

<212> Dm 

<213> Homo sapiens 

<400> 17 

catattacta satctatttt tttttoagfcc tatcatccac actgcagcaa 



- 12 ~ 
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<211> 50 

<.2i2> -mh 

<213> Harac- sapiens 
<;400> 18 

tfcgfctttctt gccatccttt ctctcccggg tgstaggcat taacfcaaaat 

<210> 19 
t211> 50 

<213> Hostio sapiens 
•c400> 19 

gtttfccttgc cctcctttct ctcccgggtg ataggcatta actaaaatta 

<21Q> 2 0 

<211> SI 

<212> DNA 

<213> Homo sapiens 

<400> 20 

ggafcgcccta tctaaaaaac aaaaamcaaa aaaaaaaaag aaaaaagaaa a 

<210> 21 

<211> SI 

<212> DKA 

<213> Homo sapiens 

<400> 21 

>ntcgagagga tgccctatct aaaaamcaaa aaacaaaaaa aaaaaagaaa a 

<210> 22 
<211> SI 
<212> Dm 

<213> Homo sapiens . 
<400> 22 

aaaaatagta aaacaacaac aacaamaaaa aaacaaeaaa aaatecaaaa a 

<21C> 23 

<311> 50 

<212> mA 

<213> Homo sapiens 

<400>- 23 

caaccfcaaaa tatcgtcatg fcgtctttaaa aatgcatatt actaaatcta 

<Z10> 24 

<21X> 51 

<:212> im 

<213> Horao sapiens 

<400> 24 



- 13 - 



<210> 25 
<211> SI 

<.2v:i> mh 

<2i:i> Homo sapiens 

<400> 25 

iratgtccctg taccctacat gctgcrccta tmcccaggac cgatggtaaa t bl 

<210> 2S 

<211=. 51 

<312> DNA 

<:213> Homo sapiens 

cagatgattt caatggatac tagggwcctc tgtfcgcctcc fcctggcagag o 51 

<210> 27 
<2-il> 51 
<:212> Dim 

<-213> Homo sapiens 
<400.> 27 

taattcugtt ggtctggaat agtttktttt tfcccttttat tttttatttt t 51 

<21C}> 28 

<211> 51 

<212> »NA 

<213> Homo sapiens 

gacfcagagat gecaagcatc ttctcrtgtg tttatttgtg ctcttagagt t 51 

<210> 29 

<211> 51 

<212> DNA 

<213> Hotao sapiens 

<400> 29 . ' Ki 

acttgctgca tttcaaatgc ttggcrgtda catgtagtta gtggctaccc t 51 

<210> 30 

<211> SI 

<2i2> DKA 

<213> HotoD sapiens 

tccacaggac tctottccca qctgcrgcca cactgtgcaa cctggaattt a 51 



^213> Homo sapiens 



- 14 - 



^feidclfe ^S&lSic^^Slircgtc atcaccaatt ggatgcatct t SI 

<21C> 32 

<211> 51 

<212> JMA 

<213> Homo sapiens 

<400> 32 

tttttttttt ccttttattfc fcfctatwtttt tgagacagag fccttgctctg t St 

<Z10> 33 

<211> 51 

<212> Dim 

<213> Eosao sapiens 

<40O> 33 

aaaghgctcc tcaaccfcaaa atatcrtaat gtgtctttaa aaatgcatat t SI 

<210> 34 
^2ai> 51 

<213> Home sapiens 
<400> 34 

ggagcfctetc atcagcatca atgt^agoc aaaaatcctt aggaggaaag a 51 

<210> 35 

<211> 5a 

<312> OSA 

<213> Homo sapiens 

<400> 3S 

aagccaactg gagcttctea tcagcrtaaa tgfcgaagcca aaaatcctta g 51 

<310> 3S 

<2il> 51 

<212> I!iJA 

<213> Homo sapiens 

*:400> 36 

agcagtgcat gaggattctg atttckccac atccttgatg atacttgtfca t 51 

<21Q> 37 

<211> 51 

<212> BKA 

<;213> Homo sapiens 

<iOO> 37 ' ' 

ggcggaccca tcacgtcatc accaaytgga tgcatcttct gcfcotgfcgcsa g 51 

<210> 3 8 

<211> 51 

<212> DNA 

<213> HOTRO sapiens 



- 15 - 



rc^c4Ct:<x^tt -tt;qc,iiut5:t tagtasagcc aggattecac cafcgfctagcc a 

<210> 3S 
<211> Si 

<213> Homo sapiens 
<400> 3g 

tgttgatfctg ctgatgacta gagatrccaa gcatcttcfcc atgtgtttat t 

<210> 40 

<21X> 51 

<212> DHA 

<213> Homo sapieas 

<400> 40 

tccaaagcag ctataccatt ttacawtccc actagcagtg catgaggatt c 

<210> 41 

<211> 51 

<212> DHA 

<213> Eomo eapians 

<400> 41 

agagaaagaa tctgggeett acaggycacg ttggfcttaaa atttagacat c 

<210> 42 

<211> SI 

<212> Dm 

<213> Homo sapiens 

<40Q> 42 

tttaaaaatg catattacta astctrtttt ttttfccagfcc tatcatccao a 

<;210> 43 

<211> 51 

<212> DiSA 

<213> Homo sapiens 

<400> 4-3 

atcgatctcc tgagctcgtg atcfcgyocac cttggcttcc aaaagtggtg g 

<210> 44 

<21X> SI 

<212> DHA 

<313> Homo sapiens 

<4-00> 44 

fctcctgccct ccttttctcto ccgggkgata ggcattaact aaaattaaat a 

<210> 4 5 
<211> SI 
<212> DHA 



- 16 - 
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<400> 45 

gcrgccsittt i?.ggcaaaafcg gtttaiacatt gaatcaagga cattatgagc c 51 

<2iL">> 4L> 
-c 211> 51 

<2l?:> Homo sapiens 

tcaaacagag -cccaggcag ccaccraaag gtcttgaatg acagcttgfcc a 51 

<21Q> 47 

<2X1> 51 

<2X2> »WA 

<213> Homo sapiens 

<4 00> '^'^ 

cattccccta aatcagttgc atgagrccag cagtccatct ttgcattaat t 51 

<210> 48 
<2I2> 

«213> Homo sapiens 

<400> 48 ' 

atgcaactga tttaggggaa gggttygcct aaattaataa aagatcfcgaa t 51 

<210> 49 

<211> 51 

<2i2> Dm 

<213> Homo sapiens 

<400> 49 

tcctgtgtcc ttcatttcca ctctgygaga gfccfcgtgctg gaccctggag fc 5i 



<212> DMA 

<213» Homo sapiens 

<400> 50 

tttctctcca gggtgatagg cattatttctaa aafctaaataa aaattcagat c 

<210> 51 
<21l> 51 
<212> 

<213> Homo sapiens 
<400> 51 

ttgccctcct tfcctctcccg ggtgayaggc atfcaactaaa attaaataaa a 

<21Q> 52 
<211> 51 



- 17 - 



<400> 52 

ctgaggtggg aggatcaoot gagccsagga gfcatgaggct gcagtgagcc a 

<210s- 53 

<21i> 51 

<212> DHA 

<;213> HotGO sapiens 

-=;400> S3 

catatfcacta astctatfett ttttttcagt ctatcatcca cactgcagca a 

<210> S4 
<21l> 51 

<213> Homo sapiens 
<400> 54 

ttgfctttctfc gccctccttt ctctcfcccgg gtgataggca ttaactaaaa t 



<211> SI 

<212> DtiA 

<213> Homo sapiens 

<400> 5S 

gttttcttgc cctcctttct ctccctgggt gataggcatt aactaaaatt a 

<210> SS 

<211> S2 

<21U> DHA 

<213> Homo sapiens 

caacctaaaa tatcgtcatg tgtctattta aaaatgcata fctactaaatc ta 
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08041DUKS . TABLE. TX'r 
Table of Seqvsencas 



ID RO 


Clone Heme 




Type 




major: allele 


3926 


VfHA 


2 


minor allele 


3926 


Dm 


3 


major variant 


1231 


i"' rote, in 


4 


winos: vatiant 


1231 


Protein 


5 


primer 1 


23 


UNA 


6 




22 


DNA 


1 




107 


Protein 


8 


1,0(5387715 - 


308 


mA 


9 


rsl0490&24 


51 


OKA 


10 


rsl7623531 


51 


DNA 


11 


rsl27 81581 


51 


mh 


12 


rsl2781412 


51 


DM 


13 


rsi278l396 






14 


rsl2780331 


51 


D«A 


15 


rsi2780.157 


51 


Dm 


IS 


rsl2262'?59 


51 


mA 


17 


rsll412729 


50 


am 


18 


rsll412728 


50 


D55A 


IS 


SS11412727 


50 


Dm 


20 


rsll200636 


51 


Dm 


21 


rsll20C635 


51 


DKA 


22 


rsll200634 


51 


DNA 


23 


rsl066431S 




DSA 


24 


rsl0490924 


51 


DMA 


25 


1-310430923 


51 


DKA 


26 


rsl04SO822 


51 


DMA 


27 


rs9665334 


51 


0KA 


28 


rs3663144 


51 


DKA 
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29 


rs73157S3 


51 


mm 


30 


ra791S705 


51 


Dm 


31 


rs7S154S4 


33. 


Dm. 


32 




51 


Dm 


33 


rs7S3.2343 


51 


Dm 


34 


rs7912143 


51 


mh 


35 


rs7912135 


51 


0NA 


36 


rs7S11064 


51 


mh 


37 


i:s7S00895 


51 


mh 


38 


rs7898343 


SI 


DSfA 


39 


r37897S5C 


51 


Dnh 


40 


rs77894743 


51 


mih 


4,1 


rs7100813 


51 


DK& 


42 


rs7 088128 


51 


DKA 


43 


rs6585831 


SI 


DMA 


44 


rs4752S38 


51 


BNA 


45 


rs375084e 


51 


DNA 




rs3750847 


51 




4'? 


rs375D846 


51 


Dm 


48 


rs2?36912 


51 


ONA 


49 


rs2736911. 


51 


Dm 


50 


rs2672603 


51 


Dim 


51 


rs 2 672 502 


51 


m& 


52 


rs2672 600 


51 


DNA 


53 


rsll4i272& 


5X 


Dm 


54 


rsl3.412728 


51 


DNA 


55 


csil412727 


51 




56 


£310664316 


52 


Dm 
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